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TITLE OF INVENTION 

CHLAMYDIA ANTIGENS AND CORRESPONDING DNA FRAGMENTS AND 
USES THEREOF 

5 REFERENCE TO REIATED APPLICATIONS 

This application claims the benefit of U,S. 
Provisional Application No. 60/110,427, filed December 1, 1998, 
U.S. Provisional Application No. 60/110, 438, filed December 1, 
1998, U.S. Provisional Application No. 60/110,339, filed December 
10 1, 1998, U.S. Provisional Application No. 60/110,428, filed 
December 1, 1998,. U.S. Provisional Application No. 60/110,340, 
filed December 1, 1998. 

FIELD OF INVENTION 

15 The present invention relates to Chlamydia antigens 

and corresponding DNA molecules, which can be used to prevent 
and treat Chlamydia infection in mammals, such as humans. 

BACKGROUND OF THE INVENTION 

20 Chlamydiae are prokaryotes. They exhibit morphologic 

and structural similarities to gram-negative bacteria including 
a trilaminar outer membrane, which contains lipopolysaccharide 
and several membrane proteins that are structurally and 
functionally analogous to proteins found in E coli. They are 

25 obligate intra-cellular parasites with a unique biphasic life 
cycle consisting of a metabolically inactive but infectious 
extracellular stage and a replicating but non-infectious 
intracellular stage. The replicative stage of the life-cycle 
takes place within a membrane-bound inclusion which sequesters 

30 the bacteria away from the cytoplasm of the infected host cell. 

C- pneumoniae is a common human pathogen, originally 
described as the TWAR strain of Chlamydia p^ittaci but 
subsequently recognised to be a new species. C. pneimoniae is 
antigenically, genetically and morphologically distinct from 
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other chlamydia species (C. trachomatis, C. pecorum and 

C- psittaci) . It shows 10% or less DNA sequence homology with 

either of C. trachomatis or C, psittaci. 

C. pnexmoniae is a common cause of community acquired 
5 pneumonia, only less frequent than Streptococcus pneumoniae and 
Mycoplasma pneumoniae (Grayston et al. (1995) Journal of 
Infectious Diseases 168:1231; Campos et ai. (1995) Investigation 
of Ophthalmology and Visual Science 36:1477). It can also cause 
upper respiratory tract symptoms and disease, including 
10 bronchitis and sinusitis (Grayston et al. (1995) Journal of 
Infectious Diseases 168:1231; Grayston et al (1990) Journal of 
Infectious Diseases 161:618; Marrie (1993) Clinical Infectious 
Diseases. 18:501; Wang et al (1986) Chlamydial infections). 
Cambridge University Press, Cambridge, p. 329The great majority 
15 of the adult population (over 60%) has antibodies to 
C. pneumoniae (Wang et al (1986) Chlamydial infections. 
Cambridge University Press, Cambridge, p. 329), indicating past 
infection which was unrecognized or asymptomatic. 

C. pneumoniae infection usually presents as an acute 
20 respiratory disease (i.e., cough, sore throat, hoarseness, and 
fever; abnormal chest sounds on auscultation) - For most 
patients, the cough persists for 2 to 6 weeks, and recovery is 
slow. In approximately 10% of these cases, upper respiratory 
tract infection is followed by bronchitis or pneumonia. 
25 Furthermore, during a C. pneumoniae epidemic, subsequent 

co-infection with pneumococcus has been noted in about half of 
these pneumonia patients, particularly in the infirm and the 
elderly- As noted above, there is more and more evidence that 
C. pneumoniae infection is also linked to diseases other than 
30 respiratory infections. 

The reservoir for the organism is presumably people. 
In contrast to C. psittaci infections, there is no known bird or 
animal reservoir. Transmission has not been clearly defined. 
It may result from direct contact with secretions, from fomites. 
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or from airborne spread- There is a long incubation period, 
which may last for many months. Based on analysis of epidemics, 
C. pneumoniae appears to spread slowly through a population 
(case-to-case interval averaging 30 days) because infected 
5 persons are inefficient transmitters of the organism. 

Susceptibility to C. pneumoniae is universal. Reinfections 
occur during adulthood, following the primary infection as a 
child. C. pneumoniae appears to be an endemic disease 
throughout the world, noteworthy for superimposed intervals of 

10 increased incidence (epidemics) that persist for 2 to 3 years. 
C. trachomatis infection does not confer cross-immunity to 
C. pneumoniae. Infections are easily treated with oral 
antibiotics, tetracycline or erythromycin (2 g/d, for at least 
10 to 14 d) . A recently developed drug, azithromycin, is highly 

15 effective as a single-dose therapy against chlamydial 
infections. 

In most instances, C. pneumonia^ infection is often 
mild and without complications, and up to 90% of infections are 
subacute or unrecognized. Among children in industrialized 

20 countries, infections have been thought to be rare up to the age 
of 5 y, although a recent study (E Nermann et al. Chlamydia 
pnexamoniae in children with acute respiratory tract infections. 
Acta Paediatrica, 1998, Vol 87, Iss 1, pp 23-27) has reported 
that many children in this age group show PGR evidence of 

25 infection despite being seronegative, and estimates a prevalence 
of 17-19% in 2-4 y olds. In developing countries, the 
seroprevalence of C. pneumoniae antibodies among young children 
is elevated, and there are suspicions that C. pneumoniae may be 
an important cause of acute lower respiratory tract disease and 

30 mortality for infants and children in tropical regions of the 
world. 

From seroprevalence studies and studies of local 
epidemics, the initial C. pneumoniae infection usually happens 
between the ages of 5 and 20 y. In the USA, for example, there 
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are estimated to be 30,000 cases of childhood pneumonia each 
year caused by C. pneumoniae. Infections may cluster among 
groups of children or young adults (e.g., school pupils or 
military conscripts) . 
5 C. pneumoniae causes 10 to 25% of community-acquired 

lower respiratory tract infections (as reported from Sweden, 
Italy, Finland, and the USA) . During an epidemic, C. pneumonia 
infection may account for 50 to 60% of the cases of pnexamonia. 
During these periods, also, more episodes of mixed infections 
10 with S. pneumoniae have been reported. 

Reinfection during adulthood is common; the clinical 
presentation tends to be milder. Based on population 
seroprevalence studies, there tends to be increased exposure 
with age, which is particularly evident among men. Some 
15 investigators have speculated that a persistent, asymptomatic C. 
pneumoniae infection state is common. 

In adults of middle age or older, C. pneumoniae 
infection may progress to chronic bronchitis and sinusitis. A 
study in the USA revealed that the incidence of pneumonia caused 
20 by C. pneumoniae in persons younger than 60 years is 1 case per 
1,000 persons per year; but in the elderly, the disease 
incidence rose three-fold. C. pneumoniae infection rarely leads 
to hospitalization, except in patients with an underlying 
illness. 

25 Of considerable importance is the association of 

atherosclerosis and C. pneumoniae infection. There are several 
epidemiological studies showing a correlation of previous 
infections with C. pneumoniae and heart attacks, coronary artery 
and carotid artery disease (Saikku et al. (1988) Lancet; ii:983; 

30 Thorn et al. (1992) JAMA 268:68; Linnanmaki et al. (1993), 
Circulation 87:1030; Saikku et al. (1992) Annals Internal 
Medicine 116:273; Melnick et al (1993) American Journal of 
Medicine 95:499). Moreover, the organisms has been detected in 
atheromas and fatty streaks of the coronary, carotid, peripheral 
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arteries and aorta (Shor et al. (1992) South African. Medical 
Journal 82:158; Kuo et al. (1993) Journal of Infectious Diseases 
167:841; Kuo et al, (1993) Arteriosclerosis and Thrombosis 
13:1500; Campbell et al (1995) Journal of Infectious Diseases 
5 172:585; Chiu et al. Circulation, 1997 (In Press)). Viable 

C- pneumoniae has been recovered from the coronary and carotid . 
artery (Ramirez et al (1996) Annals of Internal Medicine 
125:979; Jackson et al. Abst. K121, p272, 36^^ ICAAC, 15-18 Sept. 
1996, New Orleans) . Furthermore, it has been shown that 

10 C. pneumoniae can induce changes of atherosclerosis in a rabbit 
model (Fong et al (1997) Journal of Clinical Microbiolology 
35:48). Taken together, these results indicate that it is 
highly probable that C. pneumoniae can cause atherosclerosis in 
humans, though the epidemiological importance of chlamydial 

15 atherosclerosis remains to be demonstrated. 

A number of recent studies have also indicated an 
association between C. pneumoniae infection and asthma. 
Infection has been linked to wheezing, asthmatic bronchitis, 
adult-onset asthma and acute exacerbations of asthma in adults, 

20 and small-scale studies have shown that prolonged antibiotic 
treatment was effective at greatly reducing the severity of the 
disease in some individuals (Hahn DL, et ai. Evidence for 
Chlamydia pneumoniae infection in steroid-dependent asthma. 
Ann Allergy Asthma Immunol. 1998 Jan; 80(1): 45-49.; Hahn DL, et 

25 ai. Association of Chlamydia pnexamoniae IgA antibodies with 
recently symptomatic asthma. Epidemiol Infect. 1996 Dec; 
117(3): 513-517; Bjornsson E, et al. Serology of chlamydia in 
relation to asthma and bronchial hyperresponsiveness. Scand J 
Infect Dis. 1996; 28(1): 63-69.; Hahn DL. Treatment of Chlamydia 

30 pneumoniae infection in adult asthma: a before-after trial. J 
Fam Pract. 1995 Oct; 41(4): 345-351.; Allegra L, et al. Acute 
exacerbations of asthma in adults: role of Chlamydia pneumoniae 
infection. Eur Respir J. 1994 Dec; 7(12): 2165-2168.; Hahn DL, 
et aJ. Association of Chlamydia pneumoniae (strain TWAR) 
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infection with wheezing, asthmatic bronchitis, and adult-onset 
asthma. JAMA. 1991 Jul 10; 266(2): 225-230). 

In light of these results a protective vaccine against 
C. pneumoniae infection would be of considerable importance. 
5 There is not yet an effective vaccine for any human chlamydial 
infection. It is conceivable that an effective vaccine can be 
developed using physically or chemically inactivated Chlamydiae. 
However, such a vaccine does not have a high margin of safety. 
In general, safer vaccines are made by genetically manipulating 
10 the organism by attenuation or by recombinant means. 

Accordingly, a major obstacle in creating an effective and safe 
vaccine against human chlamydial infection has been the paucity 
of genetic information regarding Chlamydia, specifically 
C. pneumoniae. 

15 Studies with C. trachomatis and C. psittaci indicate 

that safe and effective vaccine against Chlamydia is an 
attainable goal. For example, mice which have recovered from a 
lung infection with C. trachomatis are protected from 
infertility induced by a subsequent vaginal challenge (Pal et 

20 ai.(1996) Infection and Immunity . 64 : 5341) . Similarly, sheep 
immunized with inactivated C. psittaci were protected from 
subsequent chlamydial-induced abortions and stillbirths (Jones 
et al. (1995) Vaccine 13:715). Protection from chlamydial 
infections has been associated with Thl immune responses, 

25 particularly the induction of INFg - producing CD4+T-cells 
(Igietsemes et al. (1993) Immunology 5:317). The adoptive 
transfer of CD4+ cell lines or clones to nude or SCID mice 
conferred protection from challenge or cleared chronic disease 
(Igietseme et al (1993) Regional Immunology 5:317; Magee et al 

30 (1993) Regional Immunology 5: 305), and in vivo depletion of 
CD4+ T cells exacerbated disease post-challenge (Landers et al 
(1991) Infection & Immunity 59:3774; Magee et al (1995) 
Infection & Immunity 63:516). However, the presence of 
sufficiently high titres of neutralising antibody at mucosal 
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surfaces can also exert a protective effect (Cotter et al. 
(1995) Infection and Immunity 63:4704). 

Antigenic variation within the species C. pneumoniae 
is not well documented due to insufficient genetic information, 
5 though variation is expected to exist based on C. trachomatis. 
Serovars of C. trachomatis are defined on the basis of antigenic 
variation in the major outer membrane protein (MOMP) , but 
published C. pneumoniae MOMP gene sequences show no variation 
between several diverse isolates of the organism (Campbell et al 
10 (1990) Infection and Immunity 58:93; McCafferty et al (1995) 
Infection and Immunity 63:2387-9; Knudsen et al (1996) Third 
Meeting of the European Society for Chlamydia Research, Vienna) . 
Regions of the protein known to be conserved in other chlamydial 
MOMPs are conserved in C. pneumoniae (Campbell et al (1990) 
15 Infection and Immunity 58:93; McCafferty et al (1995) Infection 
and Immunity 63:2387-9) . One study has described a strain of C. 
pneumoniae with a MOMP of greater that usual molecular weight, 
but the gene for this has not been sequenced (Grayston et al. 
(1995) Journal of Infectious Diseases 168:1231). Partial 
20 sequences of outer membrane protein 2 from nine diverse isolates 
were also found to be invariant (Ramirez et ai (1996) Annals of 
Internal Medicine 125:979). The genes for HSP60 and HSP70 show 
little variation from other chlamydial species, as would be 
expected. The gene encoding a 76kDa antigen has been cloned 
25 from a single strain of C. pneumoniae. It has no significant 
similarity with other known chlamydial genes (Marrie (1993) 
Clinical Infectious Diseases. 18:501). 

Many antigens recognised by immune sera to 
C. pneumoniae are conserved across all chlamydiae, but 98kDa, 
30 76 kDa and 54 kDa proteins appear to be C. pneumoniae-specif ic 
(Campos et al. (1995) Investigation of Ophthalmology and Visual 
Science 36:1477; Marrie (1993) Clinical Infectious Diseases. 
18:501; Wiedmann-Al-Ahmad M, et al. Reactions of polyclonal and 
neutralizing anti-p54 monoclonal antibodies with an isolated. 
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species-specific 54-k:ilodalton protein of Chlamydia pneumoniae. 
Clin Diagn Lab Immunol. 1997 Nov; 4(6): 700-704), 
Immunoblotting of isolates with sera from patients does show 
variation of blotting patterns between isolates, indicating that 
5 serotypes C. pnexmoniae may exist (Grayston et al, (1995) 

Journal of Infectious Diseases 168:1231; Ramirez et al (1996) 
Annals of Internal Medicine 125:979). However, the results are 
potentially confounded by the infection status of the patients, 
since immunoblot profiles of a patient's sera change with time 

10 post-infection. An assessment of the number and relative 

frequency of any serotypes, and the defining antigens, is not 
yet possible. 

Accordingly, a need exists for identifying and 
isolating polynucleotide sequences of C. pneumoniae for use in 

15 preventing and treating Chlamydia infection. 

SUMMftRY OF THE INVENTION 

The present invention provides purified and isolated 
polynucleotide molecules that encode Chlamydia polypeptides 

20 which can be used in methods to prevent, treat, and diagnose 
Chlamydia infection. In one form of the invention, the 
polynucleotide molecules are selected from DNA that encode 
polypeptides CPN100634 (SEQ ID Nos: 1 and 2), CPN100635 (SEQ ID 
Nos: 3 and 4), CPN100638 (SEQ ID Nos: 5 and 6), CPN100639 (SEQ 

25 ID Nos: 7 and 8), and CPN100708 (SEQ ID Nos: 9 and 10). 

Another form of the invention provides polypeptides 
corresponding to the isolated DNA molecules. The amino acid 
sequences of the corresponding encoded polypeptides are shown 
for CPN100634 as SEQ ID No: 11, CPN100635 as SEQ ID Nos: 12 and 

30 13, CPN100638 as SEQ ID No: 14, CPN100639 as SEQ ID No: 15, and 
CPN100708 as SEQ ID No: 16. 

Those skilled in the art will readily understand that the 
invention, having provided the polynucleotide sequences encoding 
Chlamydia polypeptides, also provides polynucleotides encoding 
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fragments derived from such peptides. Moreover, the invention 
is understood to provide mutants and derivatives of such 
polypeptides and fragments derived therefrom, which result from 
the addition, deletion, or substitution of non-essential amino 
5 acids as described herein. Those skilled in the art would also 
readily understand that the invention, having provided the 
polynucleotide sequences encoding Chlamydia polypeptides, 
further provides monospecific antibodies that specifically bind 
to such polypeptides. 

10 The present invention has wide application and includes 

expression cassettes, vectors, and cells transformed or 
transfected with the polynucleotides of the invention. 
Accordingly, the present invention further provides (i) a method 
for producing a polypeptide of the invention in a recombinant 

15 host system and related expression cassettes, vectors, and 
transformed or transfected cells; (ii) a vaccine, or a live 
vaccine vector such as a pox virus. Salmonella typhimurium, or 
Vibrio cholerae vector, containing a polynucleotide of the 
invention, such vaccines and vaccine vectors being useful for, 

20 e.g., preventing and treating Chlamydia infection, in 
combination with a diluent or carrier, and related 
pharmaceutical compositions and associated therapeutic and/or 
prophylactic methods; (iii) a therapeutic and/or prophylactic 
use of an RNA or . DNA molecule of the invention, either in a 

25 naked form or formulated with a delivery vehicle, a polypeptide 
or combination of polypeptides, or a monospecific antibody of 
the invention, and related pharmaceutical compositions; (iv) a 
method for diagnosing the presence of Chlamydia in a biological 
sample, which can involve the use of a DNA or RNA molecule, a 

30 monospecific antibody, or a polypeptide of the invention; and 
(v) a method for purifying a polypeptide of the invention by 
antibody-based affinity chromatography. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

The present invention will be further understood from the 
following description with reference to the drawings, in which: 
Figure 1 shows the nucleotide sequence of the CPN100634 
5 (SEQ ID No: 1 - entire sequence and SEQ ID No: 2 - coding 

sequence) and the deduced amino acid sequence of the CPN100634 
protein from Chlamydia pneumoniae (SEQ ID No: 11). 

Figure 2 shows the restriction enzyme analysis of the 
gene encoding the C. pneumoniae CPN100634 gene. 
10 Figure 3 shows the nucleotide sequence of the CPN100635 

{SEQ ID No: 3 - entire sequence and SEQ ID No: 4 - coding 
sequence) and the deduced amino acid sequence of the CPN100635 
protein from Chlamydia pneumoniae (SEQ ID No: 12 - entire amino 
acid sequence corresponding to the open reading frame, and 13 - 
15 the post-translationally processed polypeptide) . 

Figure 4 shows the restriction enzyme analysis of the 
gene encoding the C. pneumoniae CPN100635 gene. 

Figure 5 shows the nucleotide sequence of the CPN100638 
(SEQ ID No: 5 - entire sequence and SEQ ID No: 6 - coding 
20 sequence) and the deduced amino acid sequence of the CPN100638 
protein from Chlamydia pneumoniae (SEQ ID No: 14). The sequence 
is encoded on the negative strand. 

Figure 6 shows the restriction enzyme analysis of the 
gene encoding the C. pneumoniae CPN1Q0638 gene. 
25 Figure 7 shows the nucleotide sequence of the CPN100639 

(SEQ ID No: 7 - entire sequence and SEQ ID No: 8 - coding 
sequence) and the deduced amino acid sequence of the CPN100639 
protein from Chlamydia pneumoniae (SEQ ID No: 15) . 

Figure 8 shows the restriction enzyme analysis of the 
30 gene encoding the C. pneumoniae CPN100639 gene. 

Figure 9 shows the nucleotide sequence of the CPN100708 
(SEQ ID No: 9 - entire sequence and SEQ ID No: 10 - coding 
sequence coded for on the negative strand) and the deduced amino 
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acid sequence of the CPN100708 protein from Chlamydia pneumoniae 
(SEQ ID No: 16) . 

Figure 10 shows the restriction enzyme analysis of the 
gene encoding the C. pneumoniae CPN100708 gene. 
5 Figures 11 through 15 show an identification of T and B 

cell epitopes from the amino acid sequences shown in the 
foregoing figures. 

DETAIItED DESCRIPTION OF INVENTION 

10 Open reading frames (ORFs) encoding chlamydial 

polypeptides have been identified from the pneumoniae genome. 
These polypeptides include polypeptides found permanently in the 
bacterial membrane structure, polypeptides present in the 
external vicinity of the bacterial membrane, polypeptides found 

15 permanently in the inclusion membrane structure, polypeptides 
present in the external vicinity of the inclusion membrane, and 
polypeptides released into the cytoplasm of the infected cell. 
These polypeptides can be used to prevent and treat Chlamydia 
infection. 

20 According to a first aspect of the invention, isolated 

polynucleotides are provided which encode the precursor and 
mature forms of Chlamydia polypeptides, whose amino acid 
sequences are selected from the group consisting of: SEQ ID 
Nos: 11 to 16. 

25 The term "isolated polynucleotide" is defined as a 

polynucleotide removed from the environment in which it 
naturally occurs. For example, a naturally-occurring DNA 
molecule present in the genome of a living bacteria or as part 
of a gene bank is not isolated, but the same molecule separated 

30 from the remaining part of the bacterial genome, as a result of, 
e.g., a cloning event (amplification), is isolated. Typically, 
an isolated DNA molecule is free from DNA regions (e.g., coding 
regions) with which it is immediately contiguous at the 5' or 3' 
end, in the naturally occurring genome. Such isolated 
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polynucleotides may be part of a vector or a composition and 
still be defined as isolated in that such a vector or 
composition is not part of the natural environment of such 
polynucleotide . 

5 The polynucleotide of the invention is either RNA or DNA 

(cDNA, genomic DNA, or synthetic DNA) , or modifications, 
variants, homologs or fragments thereof. The DNA is either 
double-stranded or single-stranded, and, if single-stranded, is 
either the coding strand or the non-coding (anti-sense) strand. 
10 Any one of the sequences that encode the polypeptides of the 
invention as shown in SEQ ID Nos: 1 to 10 is (a) a coding 
sequence, (b) a ribonucleotide sequence derived from 
transcription of (a) / or (c) a coding sequence which uses the 
redundancy or degeneracy of the genetic code to encode the same 
15 polypeptides. By "polypeptide" or "protein" is meant any chain 
of amino acids, regardless of length or post-translational 
modification (e.g., glycosylation or phosphorylation). Both 
terms are used interchangeably in the present application. 

Consistent with the first aspect of the invention, amino 
20 acid sequences are provided which are homologous to any one of 
SEQ ID Nos: 11 to 16. As used herein, "homologous amino acid 
sequence" is any polypeptide which is encoded, in whole or in 
part, by a nucleic acid sequence which hybridizes at 25-35**C 
below critical melting temperature (Tm) , to any portion of the 
25 nucleic acid sequences of SEQ ID Nos: 1 to 10. A homologous 
amino acid sequence is one that differs from an amino acid 
sequence shown in any one of SEQ ID Nos: 11 to 16 by one or more 
conservative amino acid substitutions. Such a sequence also 
encompass serotypic variants (defined below) as well as 
30 sequences containing deletions or insertions which retain 
inherent characteristics of the polypeptide such as 
immunogenicity. Preferably, such a sequence is at least 75%, 
more preferably 80%, and most preferably 90% identical to any 
one of SEQ ID Nos: 11 to 16. 
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Homologous amino acid sequences include sequences that 
are identical or substantially identical to SEQ ID Nos: 11 to 
16. By "amino acid sequence substantially identical" is meant a 
sequence that is at least 90%, preferably 95%, more preferably 
5 97%, and most preferably 99% identical to an amino acid sequence 
of reference and that preferably differs from the sequence of 
reference by a majority of conservative amino acid 
substitutions . 

Conservative amino acid substitutions are substitutions 
10 among amino acids of the same class. These classes include, for 
example, amino acids having uncharged polar side chains, such as 
asparagine, glutamine, serine, threonine, and tyrosine; amino 
acids haying basic side chains, such as lysine, arginine, and 
histidine; amino acids having acidic side chains, such as 
15 aspartic acid and glutamic acid; and amino acids having nonpolar 
side chains, such as glycine, alanine, valine, leucine, 
isoleucine, proline, phenylalanine, methionine, tryptophan, and 
cysteine. 

Homology is measured using sequence analysis software 
20 such as Sequence Analysis Software Package of the Genetics 
Computer Group, University of Wisconsin Biotechnology Center, 
1710 University Avenue, Madison, WI 53705. Amino acid sequences 
are aligned to maximize identity. Gaps may be artificially 
introduced into the sequence to attain proper alignment. Once 
25 the optimal alignment has been set up, the degree of homology is 
established by recording all of the positions in which the amino 
acids of both sequences are identical, relative to the total 
niamber of positions. 

Homologous polynucleotide sequences are defined in a 
30 similar way. Preferably, a homologous sequence is one that is 
at least 45%, more preferably 60%, and most preferably 85% 
identical to any one of coding sequences SEQ ID Nos: 1 to 10. 

Consistent with the first aspect of the invention, 
polypeptides having a sequence homologous to any one of SEQ ID 
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Nos: 11 to 16 include naturally-occurring allelic variants, as 
well as mutants or any other non-naturally occurring variants 
that retain the inherent characteristics of the polypeptide of 
SEQ ID Nos: 11 to 16. 
5 As is known in the art, an allelic variant is an 

alternate form of a polypeptide that is characterized as having 
a substitution, deletion, or addition of one or more amino acids 
that does not alter the biological function of the polypeptide. 
By "biological function" is meant the function of the 

10 polypeptide in the cells in which it naturally occurs, even if 
the function is not necessary for the growth or survival of the 
cells. For example, the biological function of a porin is to 
allow the entry into cells of compounds present in the 
extracellular medium- Biological function is distinct from 

15 antigenic property. A polypeptide can have more than one 
biological function . 

Allelic variants are very common in nature. For example, 
a bacterial species such as C. pneumoniae, is usually 
represented by a variety of strains that differ from each other 

20 by minor allelic variations. Indeed, a polypeptide that 

fulfills the same biological function in different strains can 
have an amino acid sequence (and polynucleotide sequence) that 
is not identical in each of the strains. Despite this 
variation, an immune response directed generally against many 

25 allelic variants has been demonstrated. In studies of the 
Chlamydial MOMP antigen, cross-strain antibody binding plus 
neutralization of infectivity occurs despite amino acid sequence 
variation of MOMP from strain to strain, indicating that the 
MOMP, when used as an immunogen, is tolerant of amino acid 

30 variations. 

Polynucleotides encoding homologous polypeptides or 
allelic variants are retrieved by polymerase chain reaction 
(PGR) amplification of genomic bacterial DNA extracted by 
conventional methods. This involves the use of synthetic 
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oligonucleotide primers matching upstream and downstream of the 
5' and 3' ends of the encoding domain. Suitable primers are 
designed according to the nucleotide sequence information 
provided in SEQ ID Nos: 1 to 10. The procedure is as follows: a 
5 primer is selected which consists of 10 to 40, preferably 15 to 
25 nucleotides. It is advantageous to select primers containing 
C and G nucleotides in a proportion sufficient to ensure 
efficient hybridization; i.e., an amount of C and G nucleotides 
of at least 40%, preferably 50% of the total nucleotide 

10 content. A standard PGR reaction contains typically 0.5 to 5 
Units of Taq DNA polymerase per 100 ^iL, 20 to 200 fiM 
deoxynucleotide each, preferably at equivalent concentrations, 
0.5 to 2.5 MM magnesium over the total deoxynucleotide 
concentration, 10^ to 10^ target molecules, and about 20 pmol of 

15 each primer. About 25 to 50 PGR cycles are performed, with an 
annealing temperature 15*'G to 5*'C below the true Tm of the 
primers. A more stringent annealing temperature improves 
discrimination against incorrectly annealed primers and reduces 
incorportion of incorrect nucleotides at the 3' end of primers. 

20 A denaturation temperature of 95*'G to 97**G is typical, although 
higher temperatures may be appropriate for dematuration of G+G- 
rich targets. The number of cycles performed depends on the 
starting concentration of target molecules, though typically 
more than 4 0 cycles is not recommended as non-specific 

25 background products tend to accumulate. 

An alternative method for retrieving polynucleotides 
encoding homologous polypeptides or allelic variants is by 
hybridization screening of a DNA or RNA library. Hybridization 
procedures are well-known in the art and are described in 

30 Ausubel et ai., {Ausubel et aJ., Current Protocols in Molecular 
Biology, John Wiley & Sons Inc., 1994), Silhavy et al. (Silhavy 
et al. Experiments with Gene Fusions, Cold Spring Harbor 
Laboratory Press, 1984), and Davis et al. (Davis et al. A Manual 
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for Genetic Engineering: Advanced Bacterial Genetics, Cold 
Spring Harbor Laboratory Press, 1980)). Important parameters 
for optimizing hybridization conditions are reflected in a 
formula used to obtain the critical melting temperature above 
5 which two complementary DNA strands separate from each other 
(Casey & Davidson, Nucl. Acid Res. (1977) 4:1539). For 
polynucleotides of about 600 nucleotides or larger, this formula 
is as follows: Tm = 81.5 + 0.5 x (% G+C) +1.6 log (positive ion 
concentration) - 0,6 x (% formamide) . Under appropriate 

10 stringency conditions, hybridization temperature (Th) is 

approximately 20 to 40**C, 20 to 25'*C, or, preferably 30 to 40*^0 
below the calculated Tm. Those skilled in the art will 
understand that optimal temperature and salt conditions can be 
readily determined. 

15 For the polynucleotides of the invention, stringent 

conditions are achieved for both pre-hybridizing and hybridizing 
incubations (i) within 4-16 hours at 42**C, in 6 x SSC containing 
50% formamide, or (ii) within 4-16 hours at 65°C in an aqueous 
6 X SSC solution (1 M NaCl, 0.1 M sodium citrate (pH 7.0)). 

20 Useful homologs and fragments thereof that do not occur 

naturally are designed using known methods for identifying 
regions of an antigen that are likely to tolerate amino acid 
sequence changes and/or deletions. As an example, homologous 
polypeptides from different species are compared; conserved 

25 sequences are identified. The more divergent sequences are the 
most likely to tolerate sequence changes. Homology among 
sequences may be analyzed using the BLAST homology searching 
algorithm of Altschul et al.. Nucleic Acids Res . 25 : 3389-3402 
(1997). Alternatively, sequences are modified such that they 

30 become more reactive to T- and/or B-cells . (See Figures 11 to 
15 below for identification of T- and B- epitopes) . Yet another 
alternative is to mutate a particular amino acid residue or 
sequence within the polypeptide in vitro, then screen the mutant 
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polypeptides for their ability to prevent or treat Chlamydia 
infection according to the method outlined below. 

A person skilled in the art will readily understand that 
by following the screening process of this invention, it will be 
5 determined without undue experimentation whether a particular 
homolog of any of SEQ ID Nos: 11 to 16 may be useful in the 
prevention or treatment of Chlamydia infection. The screening 
procedure comprises the steps: 

(i) immunizing an animal, preferably mouse, with the 
10 test homolog or fragment; 

(ii) inoculating the immunized animal with Chlamydia; 
and, 

(iii) selecting those homologs or fragments which confer 
protection against Chlamydia. 

15 By conferring protection" is meant that there is a 

reduction is severity of any of the effects of Chlamydia 
infection, in comparison with a control animal which was not 
immunized with the test homolog or fragment. 

It has been previously demonstrated (Yang, Z. P., Chi, E. 

20 Y., Kuo, C. C. and Grayston, J. T. 1993. A mouse model of C. 
pneumoniae strain TWAR pneumonitis. 61 (5) : 2037-2040) that mice 
are susceptible to intranasal infection with different isolates 
of C. pneumoniae. Strain AR-39 (Chi, E. Y., Kuo, C. C. and 
Grayston, J. T. , 1987. Unique ultrastructure in the elementary 

25 body of Chlamydia sp. strain TWAR. J. Bacterid. 169 (8) : 3757-63) 
was used in Balb/c mice as a challenge infection model to 
examine the capacity of chlamydia gene products delivered as 
naked DNA to elicit a protective response against a sublethal 
C. pneumoniae lung infection. Protective immunity is defined as 

30 an accelerated clearance of pulmonary infection. 

Groups of 7 to 9 week old male Balb/c mice (6 to 10 per 
group) were immunized intramuscularly (i.m.) plus intranasally 
(i,n.) with plasmid DNA containing the coding sequence of a 
C- pneumoniae polypeptide. Saline or the plasmid vector lacking 
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an inserted chlamydial gene was given to groups of control 
animals . 

For i.m. immunization alternate left and right quadriceps 
were injected with lOO^ig of DNA in 50^1 of PBS on three 

5 occasions at 0, 3 and 6 weeks. For i.n. immunization, 

anaesthetized mice aspirated SOjil of PBS containing 50 \ig DNA on 
three occasions at 0, 3 and 6 weeks. At week 8, immunized mice 
were inoculated i.n. with 5 x 10^ IFU of C. pneumoniae , strain 
AR39 in lOOjil of SPG buffer to test their ability to limit the 

10 growth of a sublethal C. pneumoniae challenge. 

Lungs were taken from mice at day 9 post-challenge and 
immediately homogenised in SPG buffer (7.5% sucrose, 5mM 
glutamate, 12.5mM phosphate pH7.5). The homogenate was stored 
frozen at -70**C until assay. Dilutions of the homogenate were 

15 assayed for the preisence of infectious chlamydia by inoculation 
onto monolayers of susceptible cells. The inoculum was 
centrifuged onto the cells at 3000rpm for 1 hour, then the cells 
were incubated for three days at 35°C in the presence of l|ig/ml 
cycloheximide. After incubation the monolayers were fixed with 

20 formalin and methanol then immunoperoxidase stained for the 
presence of chlamydial inclusions using convalescent sera from 
rabbits infected with C. pneumoniae and metal-enhanced DAB as a 
peroxidase substrate. 

Consistent with the first aspect of the invention, 

25 polypeptide derivatives are provided that are partial sequences 
of SEQ ID Nos: 11 to 16, partial sequences of polypeptide 
sequences homologous to SEQ ID Nos: 11 to 16, polypeptides 
derived from full-length polypeptides by internal deletion, and 
fusion proteins. 

30 It is an accepted practice in the field of immunology to 

use fragments and variants of protein immunogens as vaccines, as 
all that is required to induce an immune response to a protein 
is a small (e.g., B to 10 amino acid) immunogenic region of the 
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protein. Various short synthetic peptides corresponding to 
surface-exposed antigens of pathogens other than Chlamydia have 
been shown to be effective vaccine antigens against their 
respective pathogens, e.g. an 11 residue peptide of murine 
5 mammary tumor virus (Casey & Davidson, Nucl. Acid Res. (1977) 
4:1539), a 15-residue peptide ofSemliki Forest virus (Snijders 
et al.r 1991. J. Gen. Virol. 72:557-565), and two overlapping 
peptides of 15 residues each from canine parvovirus (Langeveld 
et al., Vaccine 12 (15) : 1473-1480, 1994). 

10 Accordingly, it will be readily apparent to one skilled 

in the art, having read the present description, that partial 
sequences of SEQ ID Nos: 11 to 16 or their homologous amino acid 
sequences are inherent to the full-length sequences and are 
taught by the present invention. Such polypeptide fragments 

15 preferably are at least 12 amino acids in length. 

Advantageously, they are at least 20 amino acids, preferably at 
least 50 amino acids, more preferably at least 75 amino acids, 
and most preferably at least 100 amino acids in length. 

Polynucleotides of 30 to 600 nucleotides encoding partial 

20 sequences of sequences homologous to SEQ ID Nos: 11 to 16 are 
retrieved by PGR amplification using the parameters outlined 
above and using primers matching the sequences upstream and 
downstream of the 5' and 3' ends of the fragment to be 
amplified. The template polynucleotide for such amplification 

25 is either the full length polynucleotide homologous to one of 
SEQ ID Nos: 1 to 10, or a polynucleotide contained in a mixture 
of polynucleotides such as a DNA or RNA library. As an 
alternative method for retrieving the partial sequences, 
screening hybridization is carried out under conditions 

30 described above and using the formula for calculating Tm. Where 
fragments of 30 to 600 nucleotides are to be retrieved, the 
calculated Tm is corrected by subtracting (600/polynucleotide 
size in base pairs) and the stringency conditions are defined by 
a hybridization temperature that is 5 to 10 °C below Tm. Where 
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oligonucleotides shorter than 20-30 bases are to be obtained, 
the formula for calculating the Tm is as follows: Tm = 4 x (G+C) 
+ 2 (A+T) . For example, an 18 nucleotide fragment of 50% G+C 
would have an approximate Tm of 54 °C. Short peptides that are 
5 fragments of SEQ. ID Nos. 11 to 16 or their homologous 

sequences, are obtained directly by chemical synthesis (E. Gross 
and H. J. Meinhofer, 4 The Peptides: Analysis, Synthesis, 
Biology; Modern Techniques of Peptide Synthesis, John Wiley & 
Sons (1981), and M. Bodanzki, Principles of Peptide Synthesis, 
10 Springer -Verlag (1984)). 

Useful polypeptide derivatives, e.g., polypeptide 
fragments, are designed using computer-assisted analysis of 
amino acid sequences. This identifies probable surface-exposed, 
antigenic regions (Hughes et ai., 1992. Infect. Immun. 
15 60(9):3497). An analysis of the 6 amino acid sequences 
contained in SEQ ID Nos: 11 to 16, based on the product of 
flexibility and hydrophobicity propensities using the program 
SEQSEE (Wishart DS, et al. "SEQSEE: a comprehensive program 
suite for protein sequence analysis." Comput Appl Biosci. 1994 
20 Apr;10 (2) :121-32) , reveal a number of potential B- and T-cell 
epitopes which may be used as a basis for selecting useful 
immunogenic fragments and variants. The results are shown in 
Figures 11 to 15. This analysis uses a reasonable combination 
of external surface features that is likely to be recognized by 
25 antibodies. Probable T-cell epitopes for HIiA-A0201 MHC subclass 
were revealed by an algorithm written at Connaught Laboratories 
that emulates an approach developed at the NIH (Parker KC, et 
ai. Peptide binding to MHC class I molecules: implications for 
antigenic peptide prediction." Immunol Res 1995; 14 (1) : 34-57) . 
30 Epitopes which induce a protective T cell-dependent 

immune response are present throughout the length of the 
polypeptide. However, some epitopes may be masked by secondary 
and tertiary structures of the polypeptide. To reveal such 
masked epitopes large internal deletions are created which 
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remove much of the original protein structure and exposes the 
masked epitopes. Such internal deletions sometimes effects the 
additional advantage of removing immunodominant regions of high 
variability among strains. Polynucleotides encoding polypeptide 
5 fragments and polypeptides having large internal deletions are 
constructed using standard methods (Ausubel et al., Current 
Protocols in Molecular Biology, John Wiley & Sons Inc., 1994). 
Such methods include standard PGR, inverse PGR, restriction 
enzyme treatment of cloned DNA molecules, or the method of 
10 Kunkel et al. (Kunkel et al, Proc. Natl. Acad. Sci. USA (1985) 
82:448). Components for these methods and instructions for 
their use are readily available from various commercial sources 
such as Stratagene. Once the deletion mutants have been 
constructed, they are tested for their ability to prevent or 
15 treat Chlamydia infection as described above. 

As used herein, a fusion polypeptide is one that contains 
a polypeptide or a polypeptide derivative of the invention fused 
at the N- or G-terminal end to any other polypeptide 
(hereinafter referred to as a peptide tail) . A simple way to 
20 obtain such a fusion polypeptide is by translation of an in- 
frame fusion of the polynucleotide sequences, i.e., a hybrid 
gene. The hybrid gene encoding the fusion polypeptide is 
inserted into an expression vector which is used to transform or 
transfect a host cell. Alternatively, the polynucleotide 
25 sequence encoding the polypeptide or polypeptide derivative is 
inserted into an expression vector in which the polynucleotide 
encoding the peptide tail is already present. Such vectors and 
instructions for their use are commercially available, e.g. the 
pMal-c2 or pMal-p2 system from New England Biolabs, in which the 
30 peptide tail is a maltose binding protein, the glutathione-S- 
transferase system of Pharmacia, or the His-Tag system available 
from Novagen. These and other expression systems provide 
convenient means for further purification of polypeptides and 
derivatives of the invention. 
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An advantageous example of a fusion polypeptide is one 
where the polypeptide or homolog or fragment of the invention is 
fused to a polypeptide having adjuvant activity, such as subunit 
B of either cholera toxin or E. coli heat-labile toxin. Another 
5 advantageous fusion is one where the polypeptide, homolog or 
fragment is fused to a strong T-cell epitope or B-cell epitope. 
Such an epitope may be one known in the art (e.g. the Hepatitis 
B virus core antigen, D.R. Millich et al., Antibody production 
to the nucleocapsid and envelope of the Hepatitis B virus primed 

10 by a single synthetic T cell site". Nature. 1987. 329:547-549), 
or one which has been identified in another polypeptide of the 
invention (Figures 11-15) . Consistent with this aspect of the 
invention is a fusion polypeptide comprising T- or B-cell 
epitopes from one of SEQ ID Nos: 11 to 16 or its homolog or 

15 fragment, wherein the epitopes are derived from multiple 
variants of said polypeptide or homolog or fragment, each 
variant differing from another in the location and sequence of 
its epitope within the polypeptide. Such a fusion is effective 
in the prevention and treatment of Chlamydia infection since it 

20 optimizes the T- and B-cell response to the overall polypeptide, 
homolog or fragment. 

To effect fusion, the polypeptide of the invention is 
fused to the N-, or preferably, to the C-tejrminal end of the 
polypeptide having adjuvant activity or T- or B-cell epitope. 

25 Alternatively, a polypeptide fragment of the invention is 
inserted internally within the amino acid sequence of the 
polypeptide having adjuvant activity. The T- or B-cell epitope 
may also be inserted internally within the amino acid sequence 
of the polypeptide of the invention. 

30 Consistent with the first aspect, the polynucleotides of 

the invention also encode hybrid precursor polypeptides 
containing heterologous signal peptides, which mature into 
polypeptides of the invention. By "heterologous signal peptide" 
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is meant a signal peptide that is not found in naturally- 
occurring precursors of polypeptides of the invention. 

A polynucleotide molecule according to the invention, 
including RNA, DNA, or modifications or combinations thereof, 
5 have various applications. A DNA molecule is used, for example, 
(i) in a process for producing the encoded polypeptide in a 
recombinant host system, (ii) in the construction of vaccine 
vectors such as poxviruses, which are further used in methods 
and compositions for preventing and/or treating Chlamydia 
10 infection, (iii) as a vaccine agent (as well as an RNA 

molecule) , in a naked form or formulated with a delivery vehicle 
and, (iv) in the construction of attenuated Chlamydia strains 
that can over-express a polynucleotide of the invention or 
express it in a non-toxic, mutated form. 
15 Accordingly, a second aspect of the invention encompasses 

(i) an expression cassette containing a DNA molecule of the 
invention placed under the control of the elements required for 
expression, in particular under the control of an appropriate 
promoter; (ii) an expression vector containing an expression 
20 cassette of the invention; (iii) a procaryotic or eucaryotic 
cell transformed or transfected with an expression cassette 
and/or vector of the invention, as well as (iv) a process for 
producing a polypeptide or polypeptide derivative encoded by a 
polynucleotide of the invention, which involves culturing a 
25 procaryotic or eucaryotic cell transformed or transfected with 
an expression cassette and/or vector of the invention, under 
conditions that allow expression of the DNA molecule of the 
invention and, recovering the encoded polypeptide or polypeptide 
derivative from the cell culture. 
30 A recombinant expression system is selected from 

procaryotic and eucaryotic hosts. Eucaryotic hosts include 
yeast cells (e.g., Saccharomyces cerevisiae or Pichia pastoris) , 
mammalian cells (e.g., COSl, NIH3T3, or JEG3 cells), arthropods 
cells (e.g., Spodoptera frugiperda (SF9) cells), and plant 
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cells. A preferred expression system is a procaryotic host such 
as E. coll. Bacterial and eucaryotic cells are available from a 
number of different sources including commercial sources to 
those skilled in the art, e.g., the American Type Culture 
5 Collection (ATCC; Rockville, Maryland). Commercial sources of 
cells used for recombinant protein expression also provide 
instructions for usage of the cells. 

The choice of the expression system depends on the 
features desired for the expressed polypeptide. For example, it 
10 may be useful to produce a polypeptide of the invention in a 
particular lipidated form or any other form. 

One skilled in the art would readily understand that not 
all vectors and expression control sequences and hosts would be 
expected to express equally well the polynucleotides of this 
15 invention. With the guidelines described below, however, a 

selection of vectors, expression control sequences and hosts may 
be made without undue experimentation and without departing from 
the scope of this invention. 

In selecting a vector, the host must be chosen that is 
20 compatible with the vector which is to exist and possibly 

replicate in it. Considerations are made with respect to the 
vector copy number, the ability to control the copy number, 
expression of other proteins such as antibiotic resistance. In 
selecting an expression control sequence, a number of variables 
25 are considered. Among the important variable are the relative 
strength of the sequence (e.g. the ability to drive expression 
under various conditions), the ability to control the sequence's 
function, compatibility between the polynucleotide to be 
expressed and the control sequence (e.g. secondary structures 
30 are considered to avoid hairpin structures which prevent 

efficient transcription) . In selecting the host, unicellular 
hosts are selected which are compatible with the selected 
vector, tolerant of any possible toxic effects of the expressed 
product, able to secrete the expressed product efficiently if 
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such is desired, to be able to express the product in the 
desired conformation, to be easily scaled up, and to which ease 
of purification of the final product. 

The choice of the expression cassette depends on the host 
5 system selected as well as the features desired for the 
expressed polypeptide. Typically, an expression cassette 
includes a promoter that is functional in the selected host 
system and can be constitutive or inducible; a ribosome binding 
site; a start codon (ATG) if necessary; a region encoding a 
10 signal peptide, e.g., a lipidation signal peptide; a DNA 

molecule of the invention; a stop codon; and optionally a 3' 
terminal region (translation and/or transcription terminator) . 
The signal peptide encoding region is adjacent to the 
polynucleotide of the invention and placed in proper reading 
15 frame. The signal peptide-encoding region is homologous or 

heterologous to the DNA molecule encoding the mature polypeptide 
and is compatible with the secretion apparatus of the host used 
for expression. The open reading frame constituted by the DNA 
molecule of the invention, solely or together with the signal 
20 peptide, is placed under the control of the promoter so that 
transcription and translation occur in the host system. 
Promoters and signal peptide encoding regions are widely known 
and available to those skilled in the art and include, for 
example, the promoter of Salmonella typhimurium (and 
25 derivatives) that is inducible by arabinose (promoter araB) and 
is functional in Gram-negative bacteria such as E. coli (as 
described in U.S. Patent No. 5,028,530 and in Cagnon et al., 
(Cagnon et al.. Protein Engineering (1991) 4(7):843)); the 
promoter of the gene of bacteriophage T7 encoding RNA 
30 polymerase, that is functional in a number of E. coil strains 
expressing T7 polymerase (described in U.S. Patent 
No. 4,952,496); OspA lipidation signal peptide ; and RlpB 
lipidation signal peptide (Takase et al., J. Bact, (1987) 
169:5692). 
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The expression cassette is typically part of an 
expression vector, which is selected for its ability to 
replicate in the chosen expression system. Expression vectors 
{e.g., plasmids or viral vectors) can be chosen, for example, 
5 from those described in Pouwels et ai. (Cloning Vectors: A 
Laboratory Manual 1985, Supp. 1987) , Suitable expression 
vectors can be purchased from various commercial sources. 

Methods for transf orming/transf ecting host cells with 
expression vectors are well-known in the art and depend on the 
10 host system selected as described in Ausubel et al., (Ausubel et 
al.. Current Protocols in Molecular Biology, John Wiley & Sons 
Inc., 1994). 

Upon expression, a recombinant polypeptide of the 
invention (or a polypeptide derivative) is produced and remains 

15 in the intracellular compartment, is secreted/excreted in the 
extracellular medium or in the periplasmic space, or is embedded 
in the cellular membrane. The polypeptide is recovered in a 
substantially purified form from the cell extract or from the 
supernatant after centrifugation of the recombinant cell 

20 culture- Typically, the recombinant polypeptide is purified by 
antibody-based affinity purification or by other well-known 
methods that can be readily adapted by a person skilled in the 
art, such as fusion of the polynucleotide encoding the 
polypeptide or its derivative to a small affinity binding 

25 domain. Antibodies useful for purifying by immunoaf f inity the 
polypeptides of the invention are obtained as described below. 

A polynucleotide of the invention can also be useful as a 
vaccine- There are two major routes, either using a viral or 
bacterial host as gene delivery vehicle (live vaccine vector) or 

30 administering the gene in a free form, e.g-, inserted into a 
plasmid. Therapeutic or prophylactic efficacy of a 
polynucleotide of the invention is evaluated as described below. 

Accordingly, a third aspect of the invention provides (i) 
a vaccine vector such as a poxvirus, containing a DNA molecule 
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of the invention, placed under the control of elements required 
for expression; (ii) a composition of matter comprising a 
vaccine vector of the invention, together with a diluent or 
carrier; specifically (iii) a pharmaceutical composition 
5 containing a therapeutically or prophylactically effective 
amount of a vaccine vector of the invention; (iv) a method for 
inducing an immune response against Chlamydia in a mammal (e.g., 
a human; alternatively, the method can be used in veterinary 
applications for treating or preventing Chlamydia infection of 
10 animals, e.g., cats or birds), which involves administering to 
the mammal an immunogenically effective amount of a vaccine 
vector of the invention to elicit a protective or therapeutic 
immune response to Chlamydia ; and particularly, (v) a method 
for preventing and/or treating a Chlamydia (e.g.,. 
15 C. trachomatis^ C. psittaci, C. pneumonia^ C. pecorum) 
infection^ which involves administering a prophylactic or 
therapeutic amount of a vaccine vector of the invention to an 
infected individual. Additionally, the third aspect of the 
invention encompasses the use of a vaccine vector of the 
20 invention in the preparation of a medicament for preventing 
and/or treating Chlamydia infection. 

As used herein, a vaccine vector expresses one or several 
polypeptides or derivatives of the invention. The vaccine 
vector may express additionally a cytokine, such as interleukin- 
25 2 {IL-2) or interleukin-12 {IL-12), that enhances the immune 
response (adjuvant effect) . It is understood that each of the 
components to be expressed is placed under the control of 
elements required for expression in a mammalian cell. 

Consistent with the third aspect of the invention is a 
30 composition comprising several vaccine vectors, each of them 
capable of expressing a polypeptide or derivative of the 
invention. A composition may also comprise a vaccine vector 
capable of expressing an additional Chlamydia antigen, or a 
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subunit, fragment, homolog, mutant, or derivative thereof, 
optionally together with a cytokine such as IL-2 or IL-12. 

Vaccination methods for treating or preventing infection 
in a mammal comprises use of a vaccine vector of the invention 
5 to be administered by any conventional route, particularly to a 
mucosal (e.g., ocular, intranasal, oral, gastric, pulmonary, 
intestinal, rectal, vaginal, or urinary tract) surface or via 
the parenteral (e.g., subcutaneous, intradermal, intramuscular, 
intravenous, or intraperitoneal) route. Preferred routes depend 
10 upon the choice of the vaccine vector. Treatment may be 
effected in a single dose or repeated at intervals. The 
appropriate dosage depends on various parameters understood by 
skilled artisans such as the vaccine vector itself, the route of 
administration or the condition of the mammal to be vaccinated 
15 (weight, age and the like) . 

Live vaccine vectors available in the art include viral 
vectors such as adenoviruses and poxviruses as well as bacterial 
vectors, e.g.. Shigella ^ Salmonella, Vibrio choleras, 
Lactobacillus , Bacille bili6 de Calmette-Guerin (BCG) , and 
20 Streptococcus. 

An example of an adenovirus vector, as well as a method 
for constructing an adenovirus vector capable of expressing a 
DNA molecule of the invention, are described in U.S. Patent No. 
4,920,209. Poxvirus vectors include vaccinia and canary pox 
25 virus, described in U.S. Patent No. 4,722,848 and U.S. Patent 
No. 5,364,773, respectively. (Also see, e.g., Tartaglia et al.. 
Virology (1992) 188:217) for a description of a vaccinia virus 
vector and Taylor et al. Vaccine (1995) 13:539 for a reference 
of a canary pox.) Poxvirus vectors capable of expressing a 
30 polynucleotide of the invention are obtained by homologous 
recombination as described in Kieny et al.. Nature (1984) 
312:163 so that the polynucleotide of the invention is inserted 
in the viral genome under appropriate conditions for expression 
in mammalian cells. Generally, the dose of vaccine viral 
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vector, for therapeutic or prophylactic use, can be of from 
about 1x10^ to about 1x10^^, advantageously from about 1x10^ to 
about 1x10^°, preferably of from about 1x10^ to about 1x10^ 
plaque-forming units per kilogram. Preferably, viral vectors 
5 are administered parenterally; for example, in 3 doses, 4 weeks 
apart. It is preferable to avoid adding a chemical adjuvant to 
a composition containing a viral vector of the invention and 
thereby minimizing the immune response to the viral vector 
itself. 

10 Non-toxicogenic Vibrio cholerae mutant strains that are 

useful as a live oral vaccine are known. Mekalanos et al. , 
Nature (1983) 306:551 and U.S. Patent No. 4,882,278 describe 
strains which have a substantial amount of the coding sequence 
of each of the two ctxA alleles deleted so that no functional 

15 cholerae toxin is produced. WO 92/11354 describes a strain in 
which the irgA locus is inactivated by mutation; this mutation 
can be combined in a single strain with ctxA mutations. WO 
94/01533 describes a deletion mutant lacking functional ctxA and 
attRSl DNA sequences. These mutant strains are genetically 

20 engineered to express heterologous antigens, as described in 
WO 94/19482. An effective vaccine dose of a Vibrio cholerae 
strain capable of expressing a polypeptide or polypeptide 
derivative encoded by a DNA molecule of the invention contains 
about 1x10^ to about lxlO^ preferably about 1x10^ to about 1x10®, 

25 viable bacteria in a volume appropriate for the selected route 
of administration. Preferred routes of administration include 
all mucosal routes; most preferably, these vectors are 
administered intranasally or orally. 

Attenuated Salmonella typhimurium strains, genetically 

30 engineered for recombinant expression of heterologous antigens 
or not, and their use as oral vaccines are described in 
Nakayama et al. (Bio/Technology (1988) 6:693) and WO 92/11361. 
Preferred routes of administration include all mucosal routes; 
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most preferably, these vectors are administered intranasally or 
orally. 

Other bacterial strains used as vaccine vectors in the 
context of the present invention are described for. Shigella 
5 flexneri in High et al., EMBO (1992) 11:1991 and Sizemore et 
ai.. Science (1995) 270:299; for Streptococcus gordonii in 
Medaglini et ai., Proc. Natl. Acad. Sci. USA (1995) 92:6868; and 
for Bacille Calmette Guerin in Flynn Cell. Mol. Biol. 

(1994) 40 (suppl. I):31, WO 88/06626, WO 90/00594, WO 91/13157, 
10 WO 92/01796, and WO 92/21376. 

In bacterial vectors, the polynucleotide of the invention 
is inserted into the bacterial genome or remains in a free state 
as part of a plasmid. 

The composition comprising a vaccine bacterial vector of 
15 the present invention may further contain an adjuvant . A 
niamber of adjuvants are known to those skilled in the art. 
Preferred adjuvants as provided below. 

Accordingly, a fourth aspect of the invention provides 
(i) a composition of matter comprising a polynucleotide of the 
20 invention, together with a diluent or carrier; (ii) a 

pharmaceutical composition comprising a therapeutically or 
prophylactically effective amount of a polynucleotide of the 
invention; (iii) a method for inducing an immune response 
against Chlamydia in a mammal by administration of an 
25 immunogenically effective amount of a polynucleotide of the 
invention to elicit a protective immune response to Chlamydia; 
and particularly, (iv) a method for preventing and/or treating a 
Chlamydia (e.g., C. trachomatis, C. psittaci, C. pneumoniae, or 
C- pecorum) infection, by administering a prophylactic or 
30 therapeutic amount of a polynucleotide of the invention to an 
infected individual. Additionally, the fourth aspect of the 
invention encompasses the use of a polynucleotide of the 
invention in the preparation of a medicament for preventing 
and/or treating Chlamydia infection. A preferred use includes 
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the use of a DNA molecule placed under conditions for expression 
in a mammalian cell, especially in a plasmid that is unable to 
replicate in mammalian cells and to substantially integrate in a 
mammalian genome. 
5 Use of the polynucleotides of the invention include their 

administration to a mammal as a vaccine, for therapeutic or 
prophylactic purposes. Such polynucleotides are used in the 
form of DNA as part of a plasmid that is unable to replicate in 
a mammalian cell and unable to integrate into the mammalian 
10 genome- Typically, such a DNA molecule is placed under the 
control of a promoter suitable for expression in a mairanalian 
cell. The promoter functions either ubiquitously or tissue- 
specifically. Examples of non-tissue specific promoters include 
the early Cytomegalovirus (CMV) promoter (described in U.S. 
15 Patent No. 4,168,062) and the Rous Sarcoma Virus promoter 

(described in Norton & Coffin, Molec. Cell Biol. (1985) 5:281). 
An example of a tissue-specific promoter is the desmin promoter 
which drives expression in muscle cells (Li et al.. Gene (1989) 
78:243, Li & Paulin, J. Biol. Chem, (1991) 266:6562 and Li & 
20 Paulin, J. Biol. Chem. (1993) 268:10403). Use of promoters is 
well-known to those skilled in the art. Useful vectors are 
described in numerous publications, specifically WO 94/21797 and 
Hartikka et ai.. Human Gene Therapy (1996) 7:1205. 

Polynucleotides of the invention which are used as a 
25 vaccine encode either a precursor or a mature form of the 

corresponding polypeptide. In the precursor form, the signal 
peptide is either homologous or heterologous- In the latter 
case, a eucaryotic leader sequence such as the leader sequence 
of the tissue-type plasminogen factor (tPA) is preferred. 
30 As used herein, a composition of the invention contains 

one or several polynucleotides with optionally at least one 
additional polynucleotide encoding another Chlamydia antigen 
such as urease subunit A, B, or both, or a fragment, derivative, 
mutant, or analog thereof. The composition may also contain an 
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additional polynucleotide encoding a cytokine, such as 
interleukin-2 (IL-2) or interleukin-12 (IL-12) so that the 
immune response is enhanced. These additional polynucleotides 
are placed under appropriate control for expression. 
5 Advantageously, DNA molecules of the invention and/or additional 
DNA molecules to be included in the same composition, are 
present in the same plasmid. 

Standard techniques of molecular biology for preparing 
and purifying polynucleotides are used in the preparation of 

10 polynucleotide therapeutics of the invention. For use as a 
vaccine, a polynucleotide of the invention is formulated 
according to various methods outlined below. 

One method utilizes the polynucleotide in a naked form, 
free of any delivery vehicles. Such a polynucleotide is simply 

15 diluted in a physiologically acceptable solution such as sterile 
saline or sterile buffered saline, with or without a carrier. 
When present, the carrier preferably is isotonic, hypotonic, or 
weakly hypertonic, and has a relatively low ionic strength, such 
as provided by a sucrose solution, e.g., a solution containing 

20 20% sucrose. 

An alternative method utilizes the polynucleotide in 
association with agents that assist in cellular uptake. 
Examples of such agents are (i) chemicals that modify cellular 
permeability, such as bupivacaine (see, e.g., WO 94/16737), (ii) 

25 liposomes for encapsulation of the polynucleotide, or 
(iii) cationic lipids or silica, gold, or tungsten 
microparticles which associate themselves with the 
polynucleotides. 

Anionic and neutral liposomes are well-known in the art 

30 (see, e.g.. Liposomes: A Practical Approach, RPC New Ed, IRL 
press (1990), for a detailed description of methods for making 
liposomes) and are useful for delivering a large range of 
products, including polynucleotides. Cationic lipids are also 
known in the art and are commonly used for gene delivery. Such 
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lipids include Lipofectin™ also known as DOTMA (N-[l-(2,3- 
dioleyloxy) propyl ] -N,N,N-trimethylaniinonium chloride) , DOTAP 
(1, 2-bis (oleyloxy) -3- (trimethylammonio) propane) , DDAB 
{dimethyldioctadecylammonium bromide), DOGS 
5 (dioctadecylamidologlycyl spermine) and cholesterol derivatives 
such as DC-Chol (3 beta- (N- (N' ,N' -dimethyl aminomethane) - 
carbamoyl) cholesterol). A description of these cat ionic lipids 
can be found in EP 187,702, WO 90/11092, U.S. Patent 
No. 5,283,185, WO 91/15501, WO 95/26356, and U.S. Patent 
10 No. 5,527,928. Cationic lipids for gene delivery are preferably 
used in association with a neutral lipid such as DOPE (dioleyl 
phosphatidylethanolamine) , as described in WO 90/11092 as an 
example . 

Formulation containing cationic liposomes may optionally 

15 contain other transf ection-f acilitating compounds. A number of 
them are described in WO 93/18759, WO 93/19768, WO 94/25608, and 
WO 95/02397. They include spermine derivatives useful for 
facilitating the transport of DNA through the nuclear membrane 
(see, for example, WO 93/18759) and membrane-permeabilizing 

20 compounds such as GALA, Gramicidine S, and cationic bile salts 
(see, for example, WO 93/19768). 

Gold or tungsten microparticles are used for gene 
delivery, as described in WO 91/00359, WO 93/17706, and Tang et 
al. Nature (1992) 356:152. The microparticle-coated 

25 polynucleotide is injected via intradermal or intraepidermal 

routes using a needleless injection device ("gene gun"), such as 
those described in U.S. Patent No. 4,945,050, U.S. Patent 
No. 5,015,580, and WO 94/24263. 

The amount of DNA to be used in a vaccine recipient 

30 depends, e.g. , on the strength of the promoter used in the DNA 
construct, the immunogenicity of the expressed gene product, the 
condition of the mammal intended for administration (e.g. , the 
weight, age, and general health of the mammal) , the mode of 
administration, and the type of formulation. In general, a 
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therapeutically or prophylactically effective dose from about 1 
pg to about 1 mg, preferably, from about 10 pg to about 800 ]ig 
and, more preferably, from about 25 pg to about 250 pg, can be 
administered to human adults. The administration can be 
5 achieved in a single dose or repeated at intervals. 

. The route of administration is any conventional route 
used in the vaccine field. As general guidance, a 
polynucleotide of the invention is administered via a mucosal 
surface, e.g., an ocular, intranasal, pulmonary, oral, 
10 intestinal, rectal, vaginal, and urinary tract surface; or via a 
parenteral route, e.g., by an intravenous, subcutaneous, 
intraperitoneal, intradermal, intraepidermal, or intramuscular 
route. The choice of administration route depends on the 
formulation that is selected. A polynucleotide formulated in 
15 association with bupivacaine is advantageously administered into 
muscles. When a neutral or anionic liposome or a cationic 
lipid, such as DOTMA or DC-Chol, is used, the formulation can be 
advantageously injected via intravenous, intranasal 
(aerosolization) , intramuscular, intradermal, and subcutaneous 
20 routes. A polynucleotide in a naked form can advantageously be 
administered via the intramuscular, intradermal, or sub- 
cutaneous routes . 

Although not absolutely required, such a composition can 
also contain an adjuvant. If so, a systemic adjuvant that does 
25 not require concomitant administration in order to exhibit an 
adjuvant effect is preferable such as, e.g., QS21, which is 
described in U.S. Patent No. 5,057,546. 

The sequence information provided in the present 
application enables the design of specific nucleotide probes and 
30 primers that are used for diagnostic purposes. Accordingly, a 
fifth aspect of the invention provides a nucleotide probe or 
primer having a sequence found in or derived by degeneracy of 
the genetic code from a sequence shown in any one of SEQ ID Nos: 
1 to 10. 
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The term "probe" as used in the present application 
refers to DNA (preferably single stranded) or RNA molecules (or 
modifications or combinations thereof) that hybridize under the 
stringent conditions, as defined above, to nucleic acid 
5 molecules having SEQ ID Nos: 1 to 10 or to sequences homologous 
to SEQ ID Nos: 1 to 10, or to their complementary or anti-sense 
sequences. Generally, probes are significantly shorter than 
full-length sequences . Such probes contain from about 5 to 
about 100, preferably from about 10 to about 80, nucleotides . 
10 In particular, probes have sequences that are at least 75%, 
preferably at least 85%, more preferably 95% homologous to a 
portion of any of SEQ ID Nos: 1 to 10 or that are complementary 
to such sequences. Probes may contain modified bases such as 
inosine, methyl-5-deoxycytidine, deoxyuridine, dimethylamino-5- 
15 deoxyuridine, or diamino-2, 6-purine. Sugar or phosphate 

residues may also be modified or substituted. For example, a 
deoxyribose residue may be replaced by a polyamide (Nielsen et 
al.. Science (1991) 254:1497) and phosphate residues may be 
replaced by ester groups such as diphosphate, alkyl, . 
20 arylphosphonate and phosphorothioate esters. In addition, the 
2'-hydroxyl group on ribonucleotides may be modified by 
including such groups as alkyl groups. 

Probes of the invention are used in diagnostic tests, as 
capture or detection probes. Such capture probes are 
25 conventionally immobilized on a solid support, directly or 
indirectly, by covalent means or by passive adsorption. A 
detection probe is labelled by a detection marker selected from: 
radioactive isotopes, enzymes such as peroxidase, alkaline 
phosphatase, and enzymes able to hydrolyze a chromogenic, 
30 fluorogenic, or luminescent substrate, compounds that are 
chromogenic, fluorogenic, or liominescent, nucleotide base 
analogs, and biotin. 

Probes of the invention are used in any conventional 
hybridization technique, such as dot blot (Maniatis et al.. 
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Molecular Cloning: A Laboratory Manual (1982) Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, New York), Southern blot 
(Southern, J- Mol. Biol. (1975) 98:503), northern blot 
(identical to Southern blot with the exception that RNA is used 
5 as a target), or the sandwich technique (Dunn et al.. Cell 
(1977) 12:23). The latter technique involves the use of a 
specific capture probe, and/or a specific detection probe with 
nucleotide sequences that at least partially differ from each 
other. 

10 A primer is a probe of usually about 10 to about 

40 nucleotides that is used to initiate enzymatic polymerization 
of DNA in an amplification process (e.g., PCR) , in an elongation 
process, or in a reverse transcription method. Primers used in 
diagnostic methods involving PCR are labeled by methods known in 

15 the art. 

As described herein, the invention also encompasses (i) a 
reagent comprising a probe of the invention for detecting and/or 
identifying the presence of Chlamydia in a biological material; 
(ii) a method for detecting and/or identifying the presence of 
20 Chlamydia in a biological material, in which (a) a sample is 
recovered or derived from the biological material, (b) DNA or 
RNA is extracted from the material and denatured, and (c) 
exposed to a probe of the invention, for example, a capture, 
detection probe or both, under stringent hybridization 
25 conditions, such that hybridization is detected; and (iii) a 
method for detecting and/or identifying the presence of 
Chlamydia in a biological material, in which (a) a sample is 
recovered or derived from the biological material, (b) DNA is 
extracted therefrom, (c) the extracted DNA is primed with at 
30 least one, and preferably two, primers of the invention and 
amplified by polymerase chain reaction, and (d) the amplified 
DNA fragment is produced. 

It is apparent that disclosure of polynucleotide 
sequences of SEQ ID Nos: 1 to 10, their homolog, and partial 
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sequences of either enable their corresponding amino acid 
sequences. Accordingly, a sixth aspect of the invention 
features a substantially purified polypeptide or polypeptide 
derivative having an amino acid sequence encoded by a 
5 polynucleotide of the invention. 

A "substantially purified polypeptide" as used herein is 
defined as a polypeptide that is separated from the environment 
in which it naturally occurs and/or that is free of the majority 
of the polypeptides that are present in the environment in which 

10 it was synthesized. For example, a substantially purified 
polypeptide is free from cytoplasmic polypeptides. Those 
skilled in the art would readily understand that the 
polypeptides of the invention may be purified from a natural 
source, i.e., a Chlamydia strain, or produced by recombinant 

15 means. 

Consistent with the sixth aspect of the invention are 
polypeptides, homologs or fragments which are modified or 
treated to enhance their immunogenicity in the target animal, in 
whom the polypeptide, homolog or fragments are intended to 

20 confer protection against Chlamydia. Such modifications or 

treatments include: amino acid substitutions with an amino acid 
derivative such as 3-methyhistidine, 4-hydroxyproline, 5- 
hydroxylysine etc., modifications or deletions which are carried 
out after preparation of the polypeptide, homolog or fragment, 

25 such as the modification of free amino, carboxyl or hydroxyl 
side groups of the amino acids. 

Identification of homologous polypeptides or polypeptide 
derivatives encoded by polynucleotides of the invention which 
have specific antigenicity is achieved by screening for cross- 

30 reactivity with an antiserum raised against the polypeptide of 
reference having an amino acid sequence of any one of SEQ ID 
Nos: 11 to 16. The procedure is as follows: a monospecific 
hyperimmune antiserum is raised against a purified reference 
polypeptide, a fusion polypeptide (for example, an expression 
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product of MBP, GST, or His-tag systems), or a synthetic peptide 
predicted to be antigenic. Where an antiserum is raised against 
a fusion polypeptide, two different fusion systems are employed. 
Specific antigenicity can be determined according to a number of 
5 methods, including Western blot (Towbin et ai», Proc. Natl. 
Acad. Sci. USA (1979) 76:4350), dot blot, and ELISA, as 
described below. 

In a Western blot assay, the product to be screened, 
either as a purified preparation or a total E. coli extract, is 

10 submitted to SDS-Page electrophoresis as described by Laemmli 
(Nature (1970) 227:680). After transfer to a nitrocellulose 
membrane, the material is further incubated with the 
monospecific hyperimmune antiserum diluted in the range of 
dilutions from about 1:5 to about 1:5000, preferably from about 

15 1:100 to about 1:500. Specific antigenicity is shown once a 
band corresponding to the product exhibits reactivity at any of 
the dilutions in the above range. 

In an ELISA assay, the product to be screened is 
preferably used as the coating antigen. A purified preparation 

20 is preferred, although a whole cell extract can also be used. 
Briefly, about 100 ]xl of a preparation at about 10 protein/ml 
are distributed into wells of a 96-well polycarbonate ELISA 
plate. The plate is incubated for 2 hours at 37 °C then 
overnight at 4°C. The plate is washed with phosphate buffer 

25 saline (PBS) containing 0.05% Tween 20 (PBS/Tween buffer). The 
wells are saturated with 250 ]il PBS containing 1% bovine serum 
albumin (BSA) to prevent non-specific antibody binding. After 1 
hour incubation at 37 °C, the plate is washed with PBS/Tween 
buffer. The antiserum is serially diluted in PBS/Tween buffer 

30 containing 0.5% BSA. 100 pi of dilutions are added per well. 
The plate is incubated for 90 minutes at 37 °C, washed and 
evaluated according to standard procedures- For example, a goat 
anti-rabbit peroxidase conjugate is added to the wells when 
specific antibodies were raised in rabbits. Incubation is 
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carried out for 90 minutes at 37*^0 and the plate is washed. The 
reaction is developed with the appropriate substrate and the 
reaction is measured by colorimetry (absorbance measured 
spectrophotometrically) . Under the above experimental 
5 conditions, a positive reaction is shown by O.D. values greater 
than a non immune control serum. 

In a dot blot assay, a purified product is preferred, 
although a whole cell extract can also be used. Briefly, a 
solution of the product at about 100 pg/ml is serially two-fold 
10 diluted in 50 mM Tris-HCl (pH 7.5). 100 pi of each dilution are 
applied to a nitrocellulose membrane 0.45 pm set in a 96-well 
dot blot apparatus (Biorad) . The buffer is removed by applying 
vacuum to the system. Wells are washed by addition of 50 mM 
Tris-HCl (pH 7.5) and the membrane is air-dried. The membrane 
15 is saturated in blocking buffer (50 mM Tris-HCl (pH 7.5) 0.15 M 
NaCl, 10 g/L skim milk) and incubated with an antiserum dilution 
from about 1:50 to about 1:5000, preferably about 1:500. The 
reaction is revealed according to standard procedures. For 
example, a goat anti-rabbit peroxidase conjugate is added to the 
20 wells when rabbit antibodies are used. Incubation is carried 
out 90 minutes at 37**C and the blot is washed. The reaction is 
developed with the appropriate substrate and stopped. The 
reaction is measured visually by the appearance of a colored 
spot, e.g., by colorimetry. Under the above experimental 
25 conditions, a positive reaction is shown once a colored spot is 
associated with a dilution of at least about 1:5, preferably of 
at least about 1:500. 

Therapeutic or prophylactic efficacy of a polypeptide or 
derivative of the invention can be evaluated as described below. 
30 A seventh aspect of the invention provides (i) a composition of 
matter comprising a polypeptide of the invention together with a 
diluent or carrier; specifically (ii) a pharmaceutical 
coitqposition containing a therapeutically or prophylactically 
effective amount of a polypeptide of the invention; (iii) a 



wo 00/32794 PCT/CA99/01 147 

40 

method for inducing an immune response against Chlamydia in a 
mammal, by administering to the mammal an immunogenically 
effective amount of a polypeptide of the invention to elicit a 
protective immune response to Chlamydia; and particularly, (iv) 
5 a method for preventing and/or treating a Chlamydia (e.g., 
C. trachomatis, C. psittaci, C, pneumoniae, or C. pecorum) 
infection, by administering a prophylactic or therapeutic amount 
of a polypeptide of the invention to an infected individual. 
Additionally, the seventh aspect of the invention encompasses 

10 the use of a polypeptide of the invention in the preparation of 
a medicament for preventing and/or treating Chlamydia infection. 

As used herein, the immunogenic compositions of the 
invention are administered by conventional routes known the 
vaccine field, in particular to a mucosal (e.g., ocular, 

15 intranasal, pulmonary, oral, gastric, intestinal, rectal, 

vaginal, or urinary tract) surface or via the parenteral (e.g., 
subcutaneous, intradermal, intramuscular, intravenous, or 
intraperitoneal) route. The choice of administration route 
depends upon a number of parameters, such as the adjuvant 

20 associated with the polypeptide. If a mucosal adjuvant is used, 
the intranasal or oral route is preferred. If a lipid 
formulation or an aluminum compound is used, the parenteral 
route is preferred with the sub-cutaneous or intramuscular route 
being most preferred. The choice also depends upon the nature 

25 of the vaccine agent. For example, a polypeptide of the 

invention fused to CTB or LTB is best administered to a mucosal 
surface. 

As used herein, the composition of the invention contains 
one or several polypeptides or derivatives of the invention. 
30 The composition optionally contains at least one additional 
Chlamydia antigen, or a subunit, fragment, homolog, mutant, or 
derivative thereof. 

For use in a composition of the invention, a polypeptide 
or derivative thereof is formulated into or with liposomes. 
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preferably neutral or anionic liposomes, microspheres, ISCOMS, 
or virus-like-particles (VLPs) to facilitate delivery and/or 
enhance the iiranune response. These compounds are readily 
available to one skilled in the art; for example, see Liposomes: 
5 A Practical Approach, RFC New Ed, IRL press (1990) . 

Adjuvants other than liposomes and the like are also used 
and are known in the art. Adjuvants may protect the antigen 
from rapid dispersal by sequestering it in a local deposit, or 
they may contain substances that stimulate the host to secrete 
10 factors that are chemotactic for macrophages and other 

components of the immune system. An appropriate selection can 
conventionally be made by those skilled in the art, for example, 
from those described below (see the eleventh aspect of the 
invention) . 

15 Treatment is achieved in a single dose or repeated as 

necessary at intervals, as can be determined readily by one 
skilled in the art. For example, a priming dose is followed by 
three booster doses at weekly or monthly intervals. An 
appropriate doise depends on various parameters including the 

20 recipient (e.g., adult or infant), the particular vaccine 
antigen, the route and frequency of administration, the 
presence/absence or type of adjuvant, and the desired effect 
(e.g., protection and/or treatment), as can be determined by one 
skilled in the art. In general, a vaccine antigen of the 

25 invention is administered by a mucosal route in an amount from 
about 10 pg to about 500 mg, preferably from about 1 mg to about 
200 mg. For the parenteral route of administration, the dose 
usually does not exceed about 1 mg, preferably about 100 ]ig. 
When used as vaccine agents, polynucleotides and 

30 polypeptides of the invention may be used sequentially as part 
of a multistep immunization process. For example, a mammal is 
initially primed with a vaccine vector of the invention such as 
a pox virus, e.g., via the parenteral route, and then boosted 
twice with the polypeptide encoded by the vaccine vector, e.g.. 
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via the mucosal route. In another example, liposomes associated 
with a polypeptide or derivative of the invention is also used 
for priming, with boosting being carried out mucosally using a 
soluble polypeptide or derivative of the invention in 
5 combination with a mucosal adjuvant (e.g., LT) . 

A polypeptide derivative of the invention is also used in 
accordance with the seventh aspect as a diagnostic reagent for 
detecting the presence of anti-Chlamydia antibodies, e.g., in a 
blood sample. Such polypeptides are about 5 to about 80, 
10 preferably about 10 to about 50 amino acids in length. They are 
either labeled or unlabeled, depending upon the diagnostic 
method. Diagnostic methods involving such a reagent are 
described below. 

Upon expression of a DNA molecule of the invention, a 
15 polypeptide or polypeptide derivative is produced and purified 
using known laboratory techniques. As described above, the 
polypeptide or polypeptide derivative may be produced as a 
fusion protein containing a fused tail that facilitates 
purification. The fusion product is used to immunize a small 
20 mammal, e.g., a mouse or a rabbit, in order to raise antibodies 
against the polypeptide or polypeptide derivative (monospecific 
antibodies) • Accordingly, an eighth aspect of the invention 
provides a monospecific antibody that binds to a polypeptide or 
polypeptide derivative of the invention. 
25 By "monospecific antibody" is meant an antibody that is 

capable of reacting with a unique naturally-occurring Chlamydia 
polypeptide. An antibody of the invention is either polyclonal 
or monoclonal. Monospecific antibodies may be recombinant, 
e.g., chimeric (e.g., constituted by a variable region of murine 
30 origin associated with a human constant region), humanized (a 
human immunoglobulin constant backbone together with 
hypervariable region of animal, e.g., murine, origin), and/or 
single chain. Both polyclonal and monospecific antibodies may 
also be in the form of immunoglobulin fragments, e.g., F{ab)'2 
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or Fab fragments. The antibodies of the invention are of any 
isotype, e.g., IgG or IgA, and polyclonal antibodies are of a 
single isotype or a mixture of isotypes. 

Antibodies against the polypeptides, homologs or 
5 fragments of the present invention are generated by immunization 
of a mammal with a composition comprising said polypeptide, 
homolog or fragment. Scu antibodies may be polyclonal or 
monoclonal. Methods to produce polyclonal or monoclonal 
antibodies are well known in the art. For a review, see 

10 ^'Antibodies, A Laboratory Manual, Cold Spring Harbor Laboratory, 
Eds. E. Harlow and D. Lane (1988), and D.E. Yelton et al., 1981. 
Ann. Rev. Biochem. 50:657-680.. For monoclonal antibodies, see 
Kohler and Milstein (1975) Nature. 256:495-497. 

The antibodies of the invention, which are raised to a 

15 polypeptide or polypeptide derivative of the invention, are 
produced and identified using standard immunological assays, 
e.g.. Western blot analysis, dot blot assay, or ELISA (see, 
e.g., Coligan et ai.. Current Protocols in Immunology (1994) 
John Wiley & Sons, Inc., New York, NY). The antibodies are used 

20 in diagnostic methods to detect the presence of a Chlamydia 
antigen in a sample, such as a biological sample. The 
antibodies are also used in affinity chromatography for 
purifying a polypeptide or polypeptide derivative of the 
invention. As is discussed further below, such antibodies may 

25 be used in prophylactic and therapeutic passive immunization 
methods . 

Accordingly, a ninth aspect of the invention provides 
(i) a reagent for detecting the presence of Chlamydia in a 
biological sample that contains an antibody, polypeptide, or 
30 polypeptide derivative of the invention; and (ii) a diagnostic 
method for detecting the presence of Chlamydia in a biological 
sample, by contacting the biological sample with an antibody, a 
polypeptide, or a polypeptide derivative of the invention, such 
that an immune complex is formed, and by detecting such complex 
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to indicate the presence of Chlamydia in the sample or the 
organism from which the sample is derived. 

Those skilled in the art will readily understand that the 
immune complex is formed between a component of the sample and 
5 the antibody, polypeptide, or polypeptide derivative, whichever 
is used, and that any unbound material is removed prior to 
detecting the complex. It is understood that a polypeptide 
reagent is useful for detecting the presence of anti-Cijlamydia 
antibodies in a sample, e.g., a blood sample, while an antibody 

10 of the invention is used for screening a sample, such as a 
gastric extract or biopsy, for the presence of Chlamydia 
polypeptides. 

For diagnostic applications, the reagent (i.e., the 
antibody, polypeptide, or polypeptide derivative of the 

15 invention) is either in a free state or immobilized on a solid 
support, such as a tube, a bead, or any other conventional 
support used in the field. Immobilization is achieved using 
direct or indirect means. Direct means include passive 
adsorption (non-covalent binding) or covalent binding between 

20 the support and the reagent. By "indirect means" is meant that 
an anti-reagent compound that interacts with a reagent is first 
attached to the solid support. For example, if a polypeptide 
reagent is used, an antibody that binds to it can serve as an 
anti-reagent, provided that it binds to an epitope that is not 

25 involved in the recognition of antibodies in biological samples. 
Indirect means may also employ a ligand- receptor system, for 
example, where a molecule such as a vitamin is grafted onto the 
polypeptide reagent and the corresponding receptor immobilized 
on the solid phase. This is illustrated by the biotin- 

30 streptavidin system. Alternatively, a peptide tail is added 
chemically or by genetic engineering to the reagent and the 
grafted or fused product immobilized by passive adsorption or 
covalent linkage of the peptide tail. 
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Such diagnostic agents may be included in a kit which 
also comprises instructions for use* The reagent are labeled 
with a detection means which allows for the detection of the 
reagent when it is bound to its target* The detection means may 
5 be a fluorescent agent such as fluorescein isocyanate or 

fluorescein isothiocyanate, or an enzyme such as horse radish 
peroxidase or luciferase or alkaline phosphatase, or a 
radioactive element such as ^^^I or ^^Cr. 

Accordingly, a tenth aspect of the invention provides a 
10 process for purifying, from a biological sample, a polypeptide 
or polypeptide derivative of the invention, which involves 
carrying out antibody-based affinity chromatography with the 
biological sample, wherein the antibody is a monospecific 
antibody of the invention. 
15 For use in a purification process of the invention, the 

antibody is either polyclonal or monospecific, and preferably is 
of the IgG type. Purified IgGs is prepared from an antiserum 
using standard methods (see, e.g., Coligan at al.. Current 
Protocols in Immunology (1994) John Wiley & Sons, Inc., New 
20 York, NY) . Conventional chromatography supports, as well as 
standard methods for grafting antibodies, are described in, 
e.g., Antibodies: A Laboratory Manual, D. Lane, E. Harlow, Eds. 
(1988) and outlined below. 

Briefly, a biological sample, such as an C. pneumoniae 
25 extract preferably in a buffer solution, is applied to a 

chromatography material, preferably equilibrated with the buffer 
used to dilute the biological sample so that the polypeptide or 
polypeptide derivative of the invention (i.e., the antigen) is 
allowed to adsorb onto the material. The chromatography 
30 material, such as a gel or a resin coupled to an antibody of the 
invention, is in either a batch form or a column. The unbound 
components are washed off and the antigen is then eluted with an 
appropriate elution buffer, such as a glycine buffer or a buffer 
containing a chaotropic agent, e.g., guanidine HCl, or high salt 
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concentration {e.g., 3 M MgCl2) . Eluted fractions are recovered 
and the presence of the antigen is detected, e.g., by measuring 
the absorbance at 280 nm. 

An eleventh aspect of the invention provides (i) a 
5 composition of matter comprising a monospecific antibody of the 
invention, together with a diluent or carrier; (ii) a 
pharmaceutical composition comprising a therapeutically or 
prophylactically effective amount of a monospecific antibody of 
the invention, and (iii) a method for treating or preventing a 
10 Chlamydia (e.g., C. trachomatis^ C. psittaci, C. pneumoniae or 
C. pecorvm) infection, by administering a therapeutic or 
prophylactic amount of a monospecific antibody of the invention 
to an infected individual. Additionally, the eleventh aspect of 
the invention encompasses the use of a monospecific antibody of 
15 the invention in the preparation of a medicament for treating or 
preventing Chlamydia infection. 

The monospecific antibody is either polyclonal or 
monoclonal, preferably of the IgA isotype (predominantly) . In 
passive immunization, the antibody is administered to a mucosal 
20 surface of a mammal, e.g., the gastric mucosa, e.g., orally or 
intragastrically, advantageously, in the presence of a 
bicarbonate buffer. Alternatively, systemic administration, not 
requiring a bicarbonate buffer, is carried out. A monospecific 
antibody of the invention is administered as a single active 
25 component or as a mixture with at least one monospecific 

antibody specific for a different Chlamydia polypeptide. The 
amount of antibody and the particular regimen used are readily 
determined by one skilled in the art. For example, daily 
administration of about 100 to 1,000 mg of antibodies over one 
30 week, or three doses per day of about 100 to 1,000 mg of 

antibodies over two or three days, are effective regimens for 
most purposes. 

Therapeutic or prophylactic efficacy are evaluated using 
standard methods in the art, e.g., by measuring induction of a 
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mucosal immune response or induction of protective and/or 
therapeutic immunity, using, e.g., the C. pneumoniae mouse 
model. Those skilled in the art will readily recognize that the 
C. pneumoniae strain of the model may be replaced with another 
5 Chlamydia strain. For example, the efficacy of DNA molecules 
and polypeptides from C. pneumoniae is preferably evaluated in a 
mouse model using C. pneumoniae strain. Protection is 
determined by comparing the degree of Chlamydia infection to 
that of a control group. Protection is shown when infection is 

10 reduced by comparison to the control group. Such an evaluation 
is made for polynucleotides, vaccine vectors, polypeptides and 
derivatives thereof, as well as antibodies of the invention. 

Adjuvants useful in any of the vaccine compositions 
described above are as follows. 

15 Adjuvants for parenteral administration include aluminum 

compounds, such as aluminum hydroxide, aluminum phosphate, and 
aluminum hydroxy phosphate. The antigen is precipitated with, 
or adsorbed onto, the aluminum compound according to standard 
protocols. Other adjuvants, such as RIBI (ImmunoChem, Hamilton, 

20 MT) , is used in parenteral administration. 

Adjuvants for mucosal administration include bacterial 
toxins, e.g., the cholera toxin (CT) , the E. coli heat-labile 
toxin (LT) , the Clostridium difficile toxin A and the pertussis 
toxin (PT), or combinations, subunits, toxoids, or mutants 

25 thereof such as a purified preparation of native cholera toxin 
subunit B (CTB) . Fragments, homologs, derivatives, and fusions 
to any of these toxins are also suitable, provided that they 
retain adjuvant activity. Preferably, a mutant having reduced 
toxicity is used. Suitable mutants are described, e.g., in WO 

30 95/17211 (Arg-7-Lys CT mutant), WO 96/06627 (Arg-192-Gly LT 
mutant) , and WO 95/34323 (Arg-9-Lys and Glu-129-Gly PT mutant) - 
Additional LT mutants that are used in the methods and 
compositions of the invention include, e.g., Ser-63-Lys, Ala-69- 
Gly, Glu-llO-Asp, and Glu-112-Asp mutants. Other adjuvants. 
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such as a bacterial monophosphoryl lipid A (MPLA) of, e.g.f JEJ- 
coli. Salmonella minnesotar Salmonella typhimuriumf or Shigella 
flexneri; saponins, or polylactide glycolide (PLGA) 
microspheres, is also be used in mucosal administration. 
5 Adjuvants useful for both mucosal and parenteral 

administrations include polyphosphazene (WO 95/02415), DC-chol 
(3 b- (N- (N' ,N' -dimethyl aminomethane) -carbamoyl) cholesterol; 
U.S. Patent No. 5,283,185 and WO 96/14831) and QS-21 (WO 
88/09336) . 

10 Any pharmaceutical composition of the invention 

containing a polynucleotide, a polypeptide, a polypeptide 
derivative, or an antibody of the invention, is manufactured in 
a conventional manner. In particular, it is formulated with a 
pharmaceutically acceptable diluent or carrier, e.g., water or a 

15 saline solution such as phosphate buffer saline. In general, a 
diluent or carrier is selected on the basis of the mode and 
route of administration, and standard pharmaceutical practice. 
Suitable pharmaceutical carriers or diluents, as well as 
pharmaceutical necessities for their use in pharmaceutical 

20 formulations, are described in Remington's Pharmaceutical 
Sciences, a standard reference text in this field and in the 
USP/NF. 

The invention also includes methods in which Chlamydia 
infection are treated by oral administration of a Chlamydia 

25 polypeptide of the invention and a mucosal adjuvant, in 

combination with an antibiotic, an antacid, sucralfate, or a 
combination thereof. Examples of such compounds that can be 
administered with the vaccine antigen and the adjuvant are 
antibiotics, including, e.g., macrolides, tetracyclines, and 

30 derivatives thereof (specific examples of antibiotics that can 
be used include azithromycin or doxicyclin or immunomodulators 
such as cytokines or steroids) . In addition, compounds 
containing more than one of the above-listed components coupled 
together, are used. The invention also includes compositions 



wo 00/32794 PCT/CA99/0U47 

49 

for carrying out these methods, i.e., compositions containing a 
Chlamydia antigen (or antigens) of the invention, an adjuvant, 
and one or more of the above-listed compounds, in a 
pharmaceutically acceptable carrier or diluent. 
5 Amounts of the above-listed compounds used in the methods 

and compositions of the invention are readily determined by one 
skilled in the art. Treatment /immunization schedules are also 
known and readily designed by one skilled in the art. For 
example, the non- vaccine components can be administered on days 
10 1-14, and the vaccine antigen + adjuvant can be administered on 
days 7, 14, 21, and 28. 
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CLAIMS 

5 1. A nucleic acid molecule comprising a nucleic acid 

sequence which encodes a polypeptide selected from any of: 

(a) SEQ ID Nos: 11 to 16; 

(b) an immunogenic fragment comprising at least 12 
consecutive amino acids from a polypeptide of (a); and 

10 (c) a polypeptide of (a) or (b) which has been modified to 

improve its immunogenicity, wherein said modified 
polypeptide is at least 75% identical in amino acid 
sequence to the corresponding polypeptide of (a) or 
(b). 

15 

2. A nucleic acid molecule comprising a nucleic acid 

sequence selected from any of: 

(a) SEQ ID Nos: 1 to 10; 

(b) a sequence which encodes a polypeptide encoded by any 
20 one of SEQ ID Nos: 1 to 10; 

(c) a sequence comprising at least 38 consecutive 
nucleotides from any one of the nucleic acid sequences 
of (a) and (b) ; and 

(d) a sequence which encodes a polypeptide which is at 

25 least 75% identical in amino acid sequence to any one 

of the polypeptides encoded by SEQ ID Nos: 1 to 10. 
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3. A nucleic acid molecule comprising a nucleic acid 

sequence which is anti-sense to the nucleic acid molecule of 
claim 1 or 2. 

5 4. A nucleic acid molecule comprising a nucleic acid 

sequence which encodes a fusion protein, said fusion protein 
comprising a polypeptide encoded by a nucleic acid molecule 
according to claim 1 and an additional polypeptide. 

10 5. The nucleic acid molecule of claim 4 wherein the 

additional polypeptide is a heterologous signal peptide. 

6. The nucleic acid molecule of claim 4 wherein the 
additional polypeptide has adjuvant activity. 

7. A nucleic acid molecule according to any one of claims 
1 to 6, operatively linked to one or more expression control 
sequences . 

20 8. A vaccine comprising at least one first nucleic acid 

according to any one of claims 1, 2, and 4 to 7 and a vaccine 
vector wherein each first nucleic acid is expressed as a 
polypeptide, the vaccine optionally comprising a second nucleic 
acid encoding an additional polypeptide which enhances the 

25 immune response to the polypeptide expressed by said first 
nucleic acid. 
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9. The vaccine of claim 8 wherein the second nucleic acid 

encodes an additional Chlamydia polypeptide. 

5 10. A pharmaceutical composition comprising a nucleic acid 

according to any one of claims 1 to 7 and a pharmaceutically 
acceptable carrier. 

11. A pharmaceutical composition comprising a vaccine 
10 according to claim 8 or 9 and a pharmaceutically acceptable 

carrier. 

12. A unicellular host transformed with the nucleic acid 
molecule of claim 7 . 

15 

13. A nucleic acid probe of 5 to 100 nucleotides which 
hybridizes under stringent conditions to any one of nucleic acid 
molecules of SEQ ID Nos: 1 to 10, or to a homolog or 
complementary or anti-sense sequence of said nucleic acid 

20 molecule. 

14. A primer of 10 to 40 nucleotides which hybridizes 
under stringent conditions to any one of nucleic acid molecules 
of SEQ ID Nos: 1 to 10, or to a homolog or complementary or 

25 anti-sense sequence of said nucleic acid molecule. 



wo 00/32794 PCT/CA99/01 147 

53 

15. A polypeptide encoded by a nucleic acid sequence 
according to any one of claims 1, 2 and 4 to 7. 

A polypeptide comprising an amino acid sequence 
from any of: 
SEQ ID Nos: 11 to 16; 

an immunogenic fragment comprising at least 12 
consecutive amino acids from a polypeptide of (a) ; and 
a polypeptide of (a) or (b) which has been modified to 
improve its immunogenicity, wherein said modified 
polypeptide is at least 75% identical in amino acid 
sequence to the corresponding polypeptide of (a) or 
(b) . 

15 17. A fusion polypeptide comprising a polypeptide of claim 

15 or 16 and an additional polypeptide. 

18. The fusion polypeptide of claim 17 wherein the 
additional polypeptide is a heterologous signal peptide. 

20 

19. The fusion protein of claim 17 wherein the additional 
polypeptide has adjuvant activity, 

20. A method for producing a polypeptide of claim 15 or 
25 16, comprising the step of culturing a unicellular host 

according to claim 12 . 



16. 
5 selected 
(a) 
(b) 

(c) 

10 
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21. An antibody against the polypeptide of any one of 

claims 15 to 19. 

5 22. A vaccine comprising at least one first polypeptide 

according to any one of claims 15 to 19 and a pharmaceutically 
acceptable carrier, optionally comprising a second polypeptide 
which enhances the immune response to the first polypeptide. 

10 23. The vaccine of claim 22 wherein the second polypeptide 

comprises an additional Chlamydia polypeptide. 

24. A pharmaceutical composition comprising a polypeptide 
according to any one of claims 15 to 19 and a pharmaceutically 

15 acceptable carrier. 

25. A pharmaceutical composition comprising a vaccine 
according to claim 22 or 23 and a pharmaceutically acceptable 
carrier. 

20 

26. A pharmaceutical composition comprising an antibody 
according to claim 21 and a pharmaceutically acceptable carrier. 



27 . A method for preventing or treating Chlamydia 

25 infection using: 

(a) the nucleic acid of any one of claims 1 to 7; 
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(b) the vaccine of any one of claims 8, 9, 22 and 23; 

(c) the pharmaceutical composition of any one of claims 
10, 11, 24 to 26; 

(d) the polypeptide of any one of claims 15 to 19; or 
5 (e) the antibody of claim 21. 

28. A method of detecting Chlamydia infection comprising 
the step of assaying a body fluid of a mammal to be tested, with 
a component selected from any one of: 

10 (a) the nucleic acid of any one of claims 1 to 7; 

(b) the polypeptide of any one of claims 15 to 19; and 

(c) the antibody of claim 21- 

29. A diagnostic kit comprising instructions for use and a 
15 component selected from any one of: 

(a) the nucleic acid of any one of claims 1 to 7; 

(b) the polypeptide of any one of claims 15 to 19; and 

(c) the antibody of claim 21. 

20 30, A method for identifying a polypeptide of claims 15 to 

19 which induces an immune response effective to prevent or 
lessen the severity of Chlamydia infection in a mammal 
previously immunized with polypeptide, comprising the steps of: 
(a) immunizing a mouse with the polypeptide; and 

25 (b) inoculating the immunized mouse with Chlamydia; 
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wherein the polypeptide which prevents or lessens the severity 
of Chlamydia infection in the immunized mouse compared to a non- 
immunized control mouse is identified. 
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Figure 1 : Nucleotide and deduced amino acid sequence of CPN 100634 

gataaaaatt cttgacagct gttttgtcat ctttaacttg atttacttat tttgtttcta 60 

tattgatgcg aatagttctc taaaaaacaa aagcattacc atg aag act teg att 115 

Met Lys Thr Ser lie 
1 5 

cct tgg gtt tta gtt tec tec gtg tta get tte tea tgt cac eta cag 163 
Pro Trp Val Leu Val Ser Ser Val Leu Ala Phe Ser Cys His Leu Gin 
10 15 20 

tea eta get aac gag gaa ctt tta tea cct gat gat age ttt aat gga 211 
Ser Leu Ala Asn Glu Glu Leu Leu Ser Pro Asp Asp Ser Phe Asn Gly 
25 30 35 

aat ate gat tea gga acg ttt act eca aaa act tea gee aca aca tat 259 
Asn He Asp Ser Gly Thr Phe Thr Pro Lys Thr Ser Ala Thr Thr Tyr 
40 45 50 

tot eta aca gga gat gtc tte ttt tac gag cct gga aaa ggc act ccc 307 
Ser Leu Thr Gly Asp Val Phe Phe Tyr Glu Pro Gly Lys Gly Thr Pro 
55 60 65 

tta tct gae agt tgt ttt aag caa ace acg gae aat ctt ace tte ttg 355 
Leu Ser Asp Ser Cys Phe Lys Gin Thr Thr Asp Asn Leu Thr Phe Leu 
70 75 80 85 

ggg aac ggt eat age tta acg ttt ggc ttt ata gat get ggc act cat 403 
Gly Asn Gly His Ser Leu Thr Phe Gly Phe He Asp Ala Gly Thr His 
90 95 100 

gca ggt get get gea tct aca aca gea aat aag aat ctt acc tte tea 451 
Ala Gly Ala Ala Ala Ser Thr Thr Ala Asn Lys Asn Leu Thr Phe Ser 
105 110 115 

ggg ttt tec tta ctg agt ttt gat tec tct cct age aca acg gtt act 499 
Gly Phe Ser Leu Leu Ser Phe Asp Ser Ser Pro Ser Thr Thr Val Thr 
120 125 130 

aca ggt cag gga acg ctt tec tea gca gga ggc gta aat tta gaa aat 547 
Thr Gly Gin Gly Thr Leu Ser Ser Ala Gly Gly Val Asn Leu Glu Asn 
135 140 145 

att cgt aaa ctt gta gtt get ggg aat ttt tct act gca gat ggt gga 595 
He Arg Lys Leu Val Val Ala Gly Asn Phe Ser Thr Ala Asp Gly Gly 
150 155 160 165 

get ate aaa gga gcg tct tte ctt tta act ggc act tct gga gat get 643 
Ala He Lys Gly Ala Ser Phe Leu Leu Thr Gly Thr Ser Gly Asp Ala 
170 175 ISO 

Ctt ttt agt aac aac tct tea tea aca aag gga gga gca att get act 691 
Leu Phe Ser Asn Asn Ser Ser Ser Thr Lys Gly Gly Ala He Ala Thr 
185 190 195 
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Fig. 1 (con't) 

aca gca ggc get cgc ata gca aat aac aca ggt tat gtt aga ttc eta 739 
Thr Ala Gly Ala Arg lie Ala Asn Asn Thr Gly Tyr Val Arg Phe Leu 
200 205 210 

tct aac ata gcg tct acg tea gga ggc get ate gat gat gaa ggc acg 787 
Ser Asn lie Ala Ser Thr Ser Gly Gly Ala lie Asp Asp Glu Gly Thr 
215 220 225 

teg ata eta teg aac aac aaa ttt eta tat ttt gaa ggg aat gca gcg 835 
Ser lie Leu Ser Asn Asn Lys Phe Leu Tyr Phe Glu Gly Asn Ala Ala 
230 235 240 245 

aaa act act ggc ggt gcg ate tge aac ace aag gcg agt gga tct eet 883 
Lys Thr Thr Gly Gly Ala He Cys Asn Thr Lys Ala Ser Gly Ser Pro 
250 255 260 

gaa ctg ata ate tct aac aat aag act ctg ate ttt get tea aac gta 931 
Glu Leu He lie Ser Asn Asn Lys Thr Leu He Phe Ala Ser Asn Val 
265 270 275 

gca gaa aca age ggt ggc gee ate cat get aaa aag eta gee ett tec 979 
Ala Glu Thr Ser Gly Gly Ala He His Ala Lys Lys Leu Ala Leu Ser 
280 285 290 

tct gga ggc ttt aca gag ttt eta ega aat aat gte tea tea gca act 1027 
Ser Gly Gly Phe Thr Glu Phe Leu Arg Asn Asn Val Ser Ser Ala Thr 
295 300 305 

ect aag ggg ggt get ate age ate gat gee tea gga gag etc agt ctt 1075 
Pro Lys Gly Gly Ala He Ser He Asp Ala Ser Gly Glu Leu Ser Leu 
310 315 320 325 

tct gca gag aca gga aac att ace ttt gta aga aat acc ctt aca aca 1123 
Ser Ala Glu Thr Gly Asn He Thr Phe Val Arg Asn Thr Leu Thr Thr 
330 335 340 

acc gga agt acc gat act ect aaa cgt aat gcg ate aac ata gga agt 1171 
Thr Gly Ser Thr Asp Thr Pro Lys Arg Asn Ala He Asn He Gly Ser 
345 350 355 

aac ggg aaa ttc acg gaa tta egg get get aaa aat cat aca att ttc 1219 
Asn Gly Lys Phe Thr Glu Leu Arg Ala Ala Lys Asn His Thr He Phe 
360 365 370 

ttc tat gat ccc ate act tea gaa gga acc tea tea gac gta ttg aag 1267 
Phe Tyr Asp Pro He Thr Ser Glu Gly Thr Ser Ser Asp Val Leu Lys 
375 380 385 

ata aat aac ggc tct gcg gga get etc aat cca tat caa gga acg att 1315 
He Asn Asn Gly Ser Ala Gly Ala Leu Asn Pro. Tyr Gin Gly Thr He 
390 395 400 405 

eta ttt tct gga gaa acc eta aca gca gat gaa ctt aaa gtt arc gac 1363 
Leu Phe Ser Gly Glu Thr Leu Thr Ala Asp Glu Leu Lys Val Ala Asp 
410 415 420 
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aat tta aaa tct tea ttc acg cag cca gtc tec eta tec gga gga aag 1411 

Asn Leu Lys Ser Ser Phe Thr Gin Pro Val Ser Leu Ser Gly Gly Lys 
425 430 435 

tta ttg eta caa aag gga gtc act tta gag age acg age ttc tct caa 1459 

Leu Leu Leu Gin Lys Gly Val Thr Leu Glu Ser Thr Ser Phe Ser Gin 

440 445 450 

gag gee ggt tct etc etc gge atg gat tea gga acg aca tta tea act 1507 

Glu Ala Gly Ser Leu Leu Gly Met Asp Ser Gly Thr Thr Leu Ser Thr 
455 460 465 

aca get ggg agt att aca ate aeg aae eta gga ate aat get gac tec 1555 

Thr Ala Gly Ser He Thr He Thr Asn Leu Gly He Asn Val Asp Ser 
470 475 480 485 

tta ggt ett aag cag ccc gtc age eta aca gea aaa ggt get tea aat 1603 

Leu Gly Leu Lys Gin Pro Val Ser Leu Thr Ala Lys Gly Ala Ser Asn 
490 495 500 

aaa gtg ate gta tct ggg aag etc aac ctg att gat att gaa ggg aae 1651 

Lys Val He Val Ser Gly Lys Leu Asn Leu He Asp He Glu Gly Asn 
505 510 515 

att tat gaa agt eat atg ttc age eat gac cag etc ttc tct eta tta 1699 

He Tyr Glu Ser His Met Phe Ser His Asp Gin Leu Phe Ser Leu Leu 

520 525 530 

aaa ate aeg gtt gat get gat gtt gat act aac gtt gac ate age age 1747 

Lys He Thr Val Asp Ala Asp Val Asp Thr Asn Val Asp He Ser Ser 
535 540 545 

ett ate cct gtt cet get gag gat cet aat tea gaa tae gga ttc caa 1795 

Leu He Pro Val Pro Ala Glu Asp Pro Asn Ser Glu Tyr Gly Phe Gin 
550 555 560 565 

gga caa tgg aat gtt aat tgg act acg gat aca get aca aat aca aaa 1843 

Gly Gin Trp Asn Val Asn Trp Thr Thr Asp Thr Ala Thr Asn Thr Lys 
570 575 580 

gag gee acg gea act tgg aee aaa aca gga ttt gtt ccc age ccc gaa 1891 

Glu Ala Thr Ala Thr Trp Thr Lys -Thr Gly Phe Val Pro Ser Pro Glu 
585 590 595 

aga aaa tct geg tta gta tgc aat ace eta tgg gga gtc ttt act gac 1939 

Arg Lys Ser Ala Leu Val Cys Asn Thr Leu Trp Gly Val Phe Thr Asp 

600 605 610 

att egc tct ctg caa eag ett gta gag ate gge gea act ggt atg gaa 1987 

He Arg Ser Leu Gin Gin Leu Val Glu He Gly Ala Thr Gly Met Glu 
615 620 625 

cac aaa caa ggt ttc tgg gtt tec tec atg acg aac ttc ctg cat aaq "3035 

His Lys Gin Gly Phe Trp Val Ser Ser Met Thr Asn Phe Leu His Lys 
630 635 640 645 
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act gga gat gaa aat cgc aaa ggc ttc cgt cat acc tct gga ggc tac 2083 
Thr Gly Asp Glu Asn Arg Lys Gly Phe Arg His Thr Ser Gly Gly Tyr 
650 655 660 

gtc ate ggt gga agt get cac act ect aaa gac gac eta ttt acc ttt 2131 
Val He Gly Gly Ser Ala His Thr Pro Lys Asp Asp Leu Phe Thr Phe 
665 670 675 

gcg ttc tgc cat etc ttt get aga gac aaa gat tgt ttt ate get cae 2179 
Ala Phe Cys His Leu Phe Ala Arg Asp Lys Asp Cys Phe He Ala His 
680 685 690 

aac aac tct aga acc tae ggt gga act tta ttc ttc aag cae tct cat 2227 
Asn Asn Ser Arg Thr Tyr Gly Gly Thr Leu Phe Phe Lys His Ser His 
695 700 705 

acc eta caa ccc caa aac tat ttg aga tta gga aga gca aag ttt tct 2275 
Thr Leu Gin Pro Gin Asn Tyr Leu Arg Leu Gly Arg Ala Lys Phe Ser 
710 715 720 725 

gaa tea get ata gaa aaa tte cct agg gaa att ccc eta gee ttg gat 2323 
Glu Ser Ala He Glu Lys Phe Pro Arg Glu lie Pro Leu Ala Leu Asp 
730 735 740 

gtc caa gtt teg ttc age cat tea gac aac cgt atg gaa acg cac tat 2371 
Val Gin Val Ser Phe Ser His Ser Asp Asn Arg Met Glu Thr His Tyr 
745 750 755 

acc tea ttg eca gaa tee gaa ggt tct tgg age aac gag tgt ata get 2419 
Thr Ser Leu Pro Glu Ser Glu Gly Ser Trp Ser Asn Glu Cys He Ala 
760 765 770 

ggt ggt ate ggc eta gac ett cct ttt gtt ett tec aac eca eat cct .2467 
Gly Gly He Gly Leu Asp Leu Pro Phe Val Leu Ser Asn Pro His Pro 
775 780 785 

ett ttc aag acc tte att eca cag atg aaa gtc gaa atg gtt tat gta 2515 
Leu Phe Lys Thr Phe He Pro Gin Met Lys Val Glu Met Val Tyr Val 
790 795 800 805 

tea caa aat age tte tte gaa age tct agt gat ggc cgt ggt ttt agt 2563 
Ser Gin Asn Ser Phe Phe Glu Ser Ser Ser Asp Gly Arg Gly Phe Ser 
810 815 820 

att gga agg etg ett aac etc teg att cct gtg ggt gcg aaa ttc gtg 2611 
He Gly Arg Leu Leu Asn Leu Ser He Pro Val Gly Ala Lys Phe Val 
825 830 835 

cag ggg gat ate gga gat tec cac acc tat gat etc tea gga ttc ttt 2659 
Gin Gly Asp He Gly Asp Ser Zyr Thr Tyr Asp Leu Ser Giy Phe Phe 
840 345 850 

gtt tec gat gtc Cat cgt aac aat ccc caa tct aca gcg act r+:t org ^.707 
Val Ser Asp Val Tyr Arg Asa Asn Pro Gin Ser Thr Ala Thr Leu Val 
855 860 865 
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Fig. 1 (con't) 

atg age cca gac tct tgg aaa att cgc ggt ggc aat ctt tea aga cag 2755 
Met Ser Pro Asp Ser Trp Lys lie Arg Gly Gly Asn Leu Ser Arg Gin 
870 ' 875 880 885 

gca ttt tta ctg agg ggt age aac aac tac gtc tac aac tec aat tgt 2803 
Ala Phe Leu Leu Arg Gly Ser Asn Asn Tyr Val Tyr Asn Ser Asn Cys 
890 895 900 

gag cte ttc gga cat tac get atg gaa etc cgt gga tct tea agg aac 2851 
Glu Leu Phe Gly His Tyr Ala Met Glu Leu Arg Gly Ser Ser Arg Asn 
905 910 915 

tac aat gta gat gtt ggt ace aaa cte cga ttc tagattgcta aaaetcceta 2904 
Tyr Asn Val Asp Val Gly Thr Lys Leu Arg Phe 
920 925 

gttcttetag ggagttttct catactttta gggaaatatt tgetat 2950 
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Figure 2: Restriction enzyme analysis of CPN 100634 

Alul 

Hpyl78III CviJI 
Apol iMspAlI 
Tsp509I I PvuII Msel SfaNI 

III I I 
GATAAAAATTCTTGACAGCTGTITTGTCATCTTTAACTTGATTTACTTATTT^ 
1 + + + + + + 60 

CTATTTTTAAGAACTGTCGACAAAACAGTAGAAATTGAACTAAATGAATAAAACAAAGA 



BsaJI 
Styl 
Mbolll 
Hinfl 
Tfil 

Nlalll Bbsl I 
MslI I TaqI I 
II II 

TATTGATGCGAATAGTTCTCTAAAAAACAAAAGCATTACCATGAAGACTTCGATTCC^ 
61 + + + + --- + + 120 

ATAACTACGCTTATCAAGAGATTTTTTGTTTTCGTAATGGT^ 



HphI Maelll Alul 

Alul I Nlalll Taal CviJi 
CviJI |MaeIIl| Tsp45I Mnll| 
Mnllj |Tsp45l| Sfcl ] Bfal || HphI 

III II I I I II I 

GGTTTTAGTTTCCTCCGTGTTAGCTTTCTCATGTCACCTACAGTCACTAGCT 

121 + + + + + + 180 

CCAAAATCTUW^GGAGGCAOATCGAAAGAGTACAGTGGATGTCAGTGATCGATTGCT 



BsaXI 

Alol 

Ppil 

EC057I 

Acll 
Mae 1 1 
Hpyl78III 
Hinfl 
Tfil 
Clal I 
TaqI I 



Msel 
Alul I 
CviJI 



ACTTTTATCACCTGATGATAGCTTTAATGGAAATATCGATTCAGGAACGTTTACTCCA 

181 + + + + + + 240 

TGAAAATAGTGGACrACTATCGAAATTACCTTTATAGCTAAGTCCTTGCAAATGAGGTTT 
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BslI 
EcoNI 
ScrFI I 

Bbsl CviJI I I 

CviJI MboII EcoRII i i 

t I III 

AACTTCAGCCACAACATATTCTCTAACAGGAGATGTCTTCTTTTACGAGCCTGGAAAAGG 

241 + + + + + +300 

TTGAAGTCGGTGTTGTATAAGAGATTGTCCTCTACAGAAGAAAATGCTCGGACCTTTTCC 



CjePI BsaJI CjePI 
HpylSBIX Taal Msel I BstDSI BslI I 

I I II I II 

CACTCCCTTATCTGACAGTTGTTTTAAGCAAACCACGGACAATCTTACCTTCTTGGGGAA 

301 + +- + + + 360 

GTGAGGGAATAGACTGTCAACAAAATTCGTTTGGTGCCTGTTAGT^TGGAAGAACCCCTT 



CviRI 
Fnu4HI 
Tsel 
Fnu4HI 
AlwNI 
BstAPI 
Mwol 
Tsel 
Aarl 
MslI 
CviRI 



Acll 
Maell 
Msel I 
Alul I I CviJI 
Taal CviJI I ISfaNI I 
I ! I 1 1 I 



Nlalll 
Bbvl 
Bbvl I 
BspMI I i 
Cac8I I 1 1 
II I I 



CGGTCATAGCTTAACGTTTGGCTTTATAGATGCTGGCACTCATGCAGGTGCTGCTGCATC 

361 + + + — + + — + 420 

GCCAGTATCGAATTGCAAACCGAAATATCTACGACCGTGAGTACGTCCACGACGACGTAG 

BseMII 

Hinfl BslI I Ddel Hinfl 

SfaNI Tfil Ddel I I BseMII I Tfil 

I I lilt! I 

TACAACAGCAAATAAGAATCTTACCTTCTCAGGGTTTTCCTTACTGAGTTTTGATTCCTC 

421 — + + + : + + + 480 

ATGTTGTCGTTTATTCTTAGAATGGAAGAGTCCCAAAAGGAATGACTCAAAACTAAGGAG 



Maelll 
Mnll I 
Bfal I Taal Sfcl 
II I I 



Mnll 
BbvCI I 
BpulOIj 
XmnI Ddel I 
I 11 



Apol 
Tsp5b9I 
BseMII I 
Mnll I I 
I M 



TCCTAGCACAACGGTTACTACAGGTCAGGGAACGCTTTCCTCAGCAGGAGGCGTAAATTT 

481 + + + + + + 540 

AGGATCGTGTTGCCAATGATGTCCAGTCCCTTGCGAAAGGAGTCGTCCTCCGCATTTAAA 
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Fig. 2 (con't) 



Bed 

Apol PstI I Hgal 

Sspl Cjel CviRI I I Alul| 

Xmnlj Tsp509I Sfcl | j | Cvi JI j 

II I I II i II 
AGAAAATATTCGTAAACTTGTAGTTGCTGGGAATTTTTCTACTGCAGATGGTGC^ 
541 + + + + + + 600 

TCTTTTATAAGCATTTGAACATCAACGACCCTTAAAAAGATGACGTCTACCACCTCGATA 

Hpyl78III Bpml 
Bsri MboII I 

cjel Msel SfaNI Maelllt j 

I I .1 II I 

CAAAGGAGCGTCTTTCCTTTTAACTGGCACTTCTGGAGATGCTCTT^ 

601 + +- --+ + + + 660 

GTTTCCTCGCAGAAAGGAAAATTGACCGTGAAGACCTCTACGAGAAAAATCATTGTTGAG 

Cac8I 

Mwol Haell I 

Mnll MunX BseRX | Hhal | | 

Earl I Tsp509I Sfcl |Cac8I | j |MwoI 

II I II I III I 

TTCATCAACAAAGGGAGGAGCAATTGCTACTAC7VGCAGGCGCTCGCATAGCAAATAA<^^ 

661 + + + + + + 720 

AAGTAGTTGTn'CCCTCCTOTTTAACGATGATGTCGTCCGCGAGCGTATCGTTTATTO 

Clal 

Hgal Hpyl78IXI TaqI 

BsaBX I MaelX | HaeXI | 

Hinfl I I MnlX I HhaX| j 

TfiX 1 I Acci I I Hin4l| 1 I 

III 11 I IN I 

AGGTTATGTTAGATTCCTATCTAACATAGCGTCrACGTCAGGAGGCGCTATCGAT^ 

721 + + + + +--- + 780 

TCCAATACAATCTAAGGATAGATTGTATCGCAGATGCyVGTCCrCCGCGATAGCTACTACT 

BsmI 
Fnu4HI 1 
CviRI I I 

TaqI ApoX Tselj j 

MaeXI I TaqI Tsp509I CjePI | | j 

I I I I I III 

AGGCACGTCGATACTATCGAACAACAAATTTCTATATTTTGAAGGGAATGCAGCGAAAAC 

781 + + + + + + 840 

TCCGTGC7U3CTATGATAGCITGri'Gl'UTAAAGATATAAAACTTCCCTTACGT 
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BsaJI 
Sty I 
Bsbl 
CjePI 
CviRI 



Bbvl 
I 



Cjel 
Dpnl 
Sau3AI 
BsrI I 
Acil| I 
II i 
II 



Alwl 
Hpyl78III 
Dpnl 
BstYl I 
Hin4I I 
Sau3AI 



Cjel BplI 
I I 



TACTGGCGGTGCGATCTGCAACACCAAGGCGAGTGGATCTCCTGAACTGATAATCTCTAA 

841 + + + + + + 900 

ATGACCGCCACGCrrAGAaSTTGTGGTTCCGCTCACCTAGAGGACTTGACTATTAGAGATT 



Dpnl 
Sau3AX I 
Hpyl88IX| I 
Plel Hinf I I 1 I 
I I 11 I 



Cjel 
MaelX I 



Acil 
Fokl I 
I I 



Bed 
Haell 
Hhal 
NlalV 
BsaHIj 
Narlj 
Banlj I 



Nlalll 
Mwol I 
I I 



CAATAAGACTCTGATCTTTGCTTCAAACGTAGCAGAAACAAGCGGTGGCGCCATCCATGC 

901 + + + + + + 

GTTATTCTGAGACTAGAAACGAAGTTTGCATCGTCTTTGTTCGCCACCGCGGTAGGTACG 



960 



CviJI 
CacSI 
Bfalj 
Alul| I 

CviJI I I j Hpyl78III Mnll Pf 111081 

Cjel Nheljl I Mnll | CviJI | Bpml| BsmAI 

I III I I I II II I 

TAAAAAGCTAGCCCTTTCCTCTGGAGGCTTTACAGAGTTTCTACGAAATAATGTCTCATC 

961 + + + + + + 1020 

ATTTTTCGATCGGGAAAGGAGACCTCCGAAATGTCTCAAAGATGCTTTATTACAGAGTAG 



BseMII 
Banll 
BsiHKAI 
Bspl286I 
Sad 

Hpyiveill Ddel| 
BSU36I I AluXll I CviRI 
BSU36I Clal Ddel | Cvi JI j | | Sfcl | 

Ddel BslI SfaNI TaqI SfaNl| j Mnll j | j BsmAI | j 

.11 I I III III I II I 

AGCAACTCCTAAGGGGGGTGCTATCAGCATCGATGCCTCAGGAGAGCTCAGTCTTTCTGC 

1021 + + + + + + 1080 

TCGTTGAGGATTCCCCCCACGATAGTCGTAGCTACGGAGTCCTCTCGAGTCAGAAAGACG 
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Fig. 2 (con't) 

BseMII Mspl BsaXI 

PstI BsaWI I Rsal | 

I I I I I 

AGAGACAGGAAACATTACCTTTGTAAG7\AATACCCTTACAACAACCGGAAGTACCGATAC 

1081 + + + + + + 1140 

TCTCTGTCCTTTGTAATGGAAACATTCTTTATGGGAATGTTGTTGGCCTTCATGGCTATG 



Tsp509I 



Fnu4HI 
Cvi JI I 
Tsell 
BscGI I I 
I II 



Sthl32I 
Apol I 

Dpnl Maelll Tsp509I | 

Maell SauSAI I Sthl32I I BscGI 1 Bbvl | 
1 II II I I M 

TCCTAAACGTAATGCGATCAACATAGGAAGTAACGGGAAATTCACGGAATTACGGGCTGC 

1141 + + + + + + 1200 

AGGATTTGCATTACGCTAGTTGTATCCTTCATTGCCCTTTAAGTGCCTTAATGCCCGACG 

Dpnl Mnll 
Sau3AI I MaeXIl 
Tsp509I Eco57I | j Hpyl88IX Hpyl88IX I I 

MboII I Alwl I I I Bed I NlalV | | I 

II II I I I I i III 

TAAAAATCATACAATTTTCTTCTATGATCCCATCACTTCAGAAGGAACCTCATCAGACGT 

1201 + + + + + + 1260 

ATTTTTAGTATGTTAAAAGAAGATACTAGGGTAGTGAAGTCTTCCTTGGAGTAGTCTGCA 

Banll 
Bcefl 
BsiHKAI 
CviJI Bspl286I 
Faul I Sad 
MboII I Alul I Hinfl 

Sthl32I| I Acil CviJI | BslI Tfil 

III I II I I 

ATTGAAGATAAATAACGGCTCTGCGGGAGCTCTCAATCCATATCAAGGAACGATTCTATT 

1261 + + + +- + + 1320 

TAACTTCTATTTATTGCCGAGACGCCCTCGAGAGTTAGGTATAGTTCCTTGCTAAGATAA 

Dral 
Msell 
MboII I I 

Hpyl78III Bpml Msel Tsp509I I I | 

I II I t I I 

TTCTGGAGAAACCCTAACAGCAGATGAACTTAAAGTTGCTGACAATTTAAAATCTTCATT 
1321 + + + + + + 1380 

AAGACCTCTTTGGGATTGTCGTCTACTTGAATTTCAACGACTGTTAT^TTTTAGAAGTAA 
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BsrI 
Cvi JI I 
Fnu4HI I I 
Tsell II 
II II 
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Hpyl78III 
Mspl 
BsaWI 
BspEI 
Mnll I 
Bbvl I I 
BsmAXI 1 
I I I 



Maelll 
Tsp45I 
Hinf 1 1 
t I 



Bce83l 
Plel I 
I t 



CACGCAGCCAGTCTCCCTATCCGGAGGAAAGTTATTGCTACAAAAGGGAGTCACTTTAGA 

1381 + + + + + + 1440 

GTGCGTCGGTCAGAGGGATAGGCCTCCTTTCT^TAACGATGTTTTCCCTCAGTGAAATCT 



Mspl 
BseRI 
BsrFI 
CviJI 
Haelll 
Hpyl78III 

Smll 
Mnll 



Hin4I 
Alul 
CviJI 
BsiHKAI I 
Bspl286I I 
BssSI I I 
I 1 I 



1441 



Hpyl78III 
Hinfl 
Mnll 
Tfil 

BsaJI Nlalll | 
BplI I BslI It I Cjel 
II I It t I 

GAGCACGAGCTTCTCTCAAGAGGCCGGTTCTCTCCTCGGCATGGATTCAGGAACGACATT 

+ + + + + + 1500 

CTCGTGCTCGAAGAGAGTTCTCCGGCCAAGAGAGGAGCCGTACCTAAGTCCTTGCTGTAA 



Alul 
CviJI 
MspAlI 
PvuII 
Sfcl I 
I I 



Hinfl 
Bfal I 
Avrlll I 
BsaJI I I 
Cjel I I I 
Hpyl78III StyljTfil 
I III 



Bsu36I 
Hinfl I 
Hindi I I 
Plel I I Ddel 
1 II ( 



ATCAACTACAGCTGGGAGTATTACAATCACGAACCTAGGAATCAATGTTGACTCCTTAGG 

1501 +- + + + + + 1560 

TAGTTGATGTCGACCCTCATAATGTTAGTGCTTGGATCCTTAGTTACAACTGAGGAATCC 



Bbvl 
CviJI 
Sthl32I 
BscGI 
CviJI 



Fnu4HI 
Msel I 
Aflllj I 
Smll I Tsell 
II II 



BstAPI Dpnl 
Mwol CjePI Sau3AI I 
t t t I 

TCTTAAGCAGCCCGTCAGCCTAACAGCAAAAGGTGCTTCAAATAAAGTGATCGTATCTGG 

1561 + + + + + + 1620 

AGAATTCGTCGGGCAGTCGGATTGTCGTTTTCCACGAAGTTTATTTCACTAGCATAGACC 
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Fig. 2 (con't) 



Nlalll 
Mboll I 

Alul CviJi 1 1 

CviJI CjePI Ndel CjePlj || 

i i I 11 11 

GAAGCTCAACCTGATTGATATTGAAGGGAACAriTATGAAAGTCATATGTTCAGCCATGA 

1621 + + + + + + 1680 

CTTCGAGTTGGACTAACrATAACTTCCCTTGTAAATACnnrrCAG 

Msel 

Acelll I Taal Hindi 

Alul Earl I I CjePI | Acll | 

CviJI Sapl I jsfaNI I I Bael Maell j 

I I I I I n I II 

CCAGCTCTTCTCTCTATTAAAAATCACGGTTGATGCTGATGTTGATACTAACGTTGACAT 

1681 +- + + + + + 1740 

GGTCGAGAAGAGAGATAATTTTTAGTGCCAACTACGACTACAACTATGATTGCAACTGTA 



Hpyl88IX 
Alwl j 
Tsp509I I ! 
Dpnl 
NlalV 
BarciHI 
BstYI 
Sau3AI 
BbvCI I 

Bae I Bpu 101 I j i j j B sa JI 

CviJI I Mnll I II III Styl 

Fnu4HI I I Bbvl | Ddel | j j j j Hinfl | 
Tsel| I iBseMII I Alwl I 11 III Tfil j 

II I 1 I I II I 

CAGCAGCCTTATCCCTGTTCCTGCTGAGGATCCTAATTCAGAATACGGATTCCAAGGACA 

1741 + + + + + + 1800 

GTCGTCGGAATAGGGACAAGGACGACrCCTAGGATTAAGTCTTATGCCTAAGGTTCCTGT 



BsaJX 
BstDSI 
CviJI I 
Haelj 
Haelll 



Avail 
Sau961 



Tsp5091 Alul 

Msel I BciVI CviJI Mnll 

II I I I • lj I 

ATGGAATGTTAATTGGACTACGGATACAGCTACAAATACAAAAGAGGCCACGGCAACTTG 

1801 + + + + + + I860 

TACCTTACAATTAACCTGATGCCTATGTCGATGTTTATGTTTTCTCCGGTGCCGTTGAAC 



Bcefl CviJI Sthl32I CviRI 

I II I 

GACCAAAACAGGATTTGTTCCCAGCCCCGAAAGAAAATCTGCGTTAGTATGCAATACCCT 

1861 + + + + + + 1920 

CTGGTTTTGTCCTAAACAAGGGTCGGGGCTTTCTTTTAGACGCAATCATACGOT 



SUBSTITUTE SHEET (RULE 26) 



wo 00/32794 



PCT/CA99/01147 



13/77 

Fig. 2(coh't) 

Alul Dpnl 
Hinfl Plel Hin4I CviRI CviJI Sau3AI | Hhal 

I II II III 

ATGGGGAGTCTTTACTGACATTCGCTCTCTGCAACAGCTTGTAGAGATCXSGCGCAACTGG 

1921 + + + + + + 1980 

TACCCCTCyiGAAATGACTGTAAGCGAGAGACGTTGTCGAACATCTCTAGCCGCGTTGACC 

Bcgl Mnll 
Bsri Tthlllll J Nlalll I CviRI 

i II I I I 

TATGGAACACAAACAAGGTTTCTGGGTTTCCTCCATGACGAACTTCCTGCATAAGAC^ 

1981 + + + + + + 2040 

ATACCTTGTGTTTGTTCCAAAGACCCAAAGGAGGTACTGCTTGAAGGAOT 

MaeXI 
BsaXI 

BsaBI Hin4I 
Bcgl I BpmX Hpyl78III Mnll 

Bsrl| 1 CviJI Mnll | CviJI | | Bpml 

Ml I I I II I I 

AQATGAAAATCGCAAAGGCTTCCGTCATACCTCTGGAGGCTACGTCATCGGTGGAAGTGC 

2041 + + ---+ + + + 2100 

TCTACTTTTAGCGTTTCCGAAGGCAGTATGGAGACCTCCGATGCAGTAGCCACCTTCACG 

BsiHKAI Bfal 
Bspl286I Bed BsmAI | 

I I I I 

TCACACTCCTAAAGACGACCTATTTACCTTTGCGTTCTGCCATCTCTTTGCTAGAGACAA 

2101 + + + + + + 2160 

AGTGTGAGGATTTCTGCTGGATAAATGGAAACGCAAGACGGTAGAGAAACGATCT^^ 

Hpyl7BIII 
Bfal I 

Xbal I I MboII 
Cjel I I I Taal | Cjel 

I III I I I 

AGATTGTTTTATCGCTCACAACAACrCTAGAACCTACGGTGGAACTTTATTCTTCA^ 

2161 + + +-- + + + 2220 

TCTAACAAAATAGCGAGTGTTGTTGAGATCTTGGATGCCACCTTGAAATAAGAAGCT 

Hinfl 
Tfil 

Earl HpyieSIX | 

Tthlllll Sapl MboII I I 

I I III 

CTCTCATACCCTACAACCCCAAAACTATTTGAGATTAGGAAGAGCAAAGTTTTCTGAATC 

2221 + + + + + + 2280 

GAGAGTATGGGATGTTGGGGTTTTGATAAACTCTAATCCTTCTCGTTTCAAAAGACTTAG 
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Apol 
Tsp509I 
Bfal 

Sf CI Avrll I 

Alul| Apol BsaJlj 
CviJlj Tsp509I Stylj 
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BslI 
Cjel 
Bsa JI I 
Styll 
Cvi JI I I 
Bfal I I I 



Fokl CviJI 



AGCTATAGAAAAATTCCCTAGGGAAATTCCCCTAGCCTTGGATGTCCAAGTTTCGTTCAG 

2281 + + - + + - + + 2340 

TCGATATCTTTTTAAGGGATCCCITTAAGGGGATCGGAACCTACT^GGT^ 



Hpyl88IX 
Hinfl 
Tfil 
Mnll| 
BsrDI CjePI I I 

11 I I II 

CCATTCAGACAACCGTATGGAAACGCACTATACCTCATTGCCAGAATCCGAAGGTTC^^ 

2341 + + + + ---+ + 2400 

GGTAAGTCrraTTGGCATACCTTTGCGTGATATGGAGTAACGGTCTTAGGCr^ 



Taal 

Hpyl88IX Cjel | 



Alul Bfal 
CviJI CviJI I Fokl 

CjePI t Haeill | CjePI | 

11 II II 

GAGCAACGAGTGTATAGCTGGTGGTATCGGCCTAGACCTTCCTTTTGTTCTTTCC;^ 

2401 + + +-[ + + + 2460 

CTCGTTGCTCACATATCGACCACCATAGCCGGATCTGGAAGGAAAACAAGAAAGGTT^^ 



CjePI 
Mmel 
Mnll I 
RleAI I 

Hpyl78IIl| t I TaqI 

II II i 
ACATCCTCTTTTCAAGACCTTCATTCCACAGATGAAAGTCGAAATGGTTTATGTATCACA 

2461 + - + + + + + 2520 

TGTAGGAGAAAAGTTCTGGAAGTAAGGTGTCTACTTTCAGCTTTACCAAATACATAGTGT 



Bfal 
Alul 
CviJI 

Alul Bcefl I 

CviJI NspV I I 
MboII I TaqI I I 
II III 



BsaJI 
BstDSI 
CviJI 
Haelll 
Bed I 
Eael I 
Gdill i 
I I 



Bbvl 



Pnu4HI 
CviJI I 
Tsel jMsel 
II I 



AAATAGCTTCTTCGAAAGCTCTAGTGATGGCCGTGGTTTTAGTATTGGAAGGCTGCTTAA 

2521 + + + + + 2580 

TTTATCGAAGAAGCTTTCGAGATCACTACCGGCACCAAAATCATAACCTTCCGACGAATT 
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Fig. 2 (con't) 



Mnll 
Bcgl 
Hinf I 
Tfil 
Hpyl78III 
TaqI 
Taqll 
RleAI I 



Bsgl 
Hinf I 
Tfil 
Bcgl I 

Apol Hpyl88IX| I 

Tsp509I CviRI ECORV | | | 

I I I II I 

CCrrCTCGATTCCTGTGGGTGCGAAATTCGTGCAGGGGGATATCGGAGATTCCTACACCTA 

2581 + + + + -+ + 2640 

GGAGAGCTAAGGACACCCACGCTTTAAGCAaSTCCCCCTATAGCCrrCTAAGGATGT^ 



Hinf I 
Tfil 
Hpyl78III I 

Dpnl I I Hpyl88IX MaellT Plel 

Sau3AI |DdeI | j BseMII | BsaBI | Sfcl| Hinf I 

i I I II I I I I II I 

TGATCrCTCAGGATTCrTTGTTTCCGATGTCTATCGTAACAATCCCCAATCTACAGC^ 

2641 + + + + + + 2700 

ACTAGAGAGTCCTAAGAAACMAGGCTACAGATAGCATTGTTAGGGGTTAGATGTCGCTG 



Apol 
Tsp509I 
BstXI 
Hinf I 
BanIZ 



B3pl2861 
CviJI I 
Plel I I 



Acil Mnll 
Thai Hpyl78III BseMII | 

I III 
TCTTGTGATGAGCCCAGACTCITGGAAAATTCGCGGTGGCAATCTTTaAC^ 

2701 + + + + + + 2760 

AGAACACTACTCGGGTCTGAGAACCTTTTAAGCGCCACCGTTAGAAAGTTCTGTCCOT 



AccI 
Maell I 



Earl 
Sap I 
Hpyl88IX 
Banll 
BsiHKAI 
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Ddel 
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TTTACTGAGGGGTAGCAACAACTACGTCTACAACTCCAATTGTGAGCrcriT 

2761 + + + + + + 2820 

AAATGACTCCCCATCGTTGTTGATGCAGATGTTGAGGTTAACACTCGAGAAGCCTGTAAT 
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Fig. 2 (con't) 
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II I I I III 

CGCTATGGAACTCCGTGGATCTTCAAGGAACTACAATGTAGATGTTGGTACCAAACTCCG 

2821 - + + + + + + 2880 

GCGATACCTTGAGGCACCTAGAAGTTCCTTGATGTTACATCTACAACCATGGTTTGAG^ 
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ATTCTTVGATTGCTAAAACTCCCTAGTTCrrCTAGGGAGTTTTCTCATACT^^ 

2881 + + + + + 2940 

TAAGATCTAACGATTTTGAGGGATCAAGAAGATCCCTCAAAAGAGTATGAAAATCCCT^ 

Sspl 
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TATTTGCTAT 

2941 + 2950 

ATAAACGATA 
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Figure 3: Nucleotide and deduced amino acid sequence of CPNl 00635 

agttaaagat gacaaaacag ctgtcaagaa tttttatctt gactctctga gttttctatt 60 
ttatatgacg caagtaagaa tttaataata aagtgggttt atg aaa teg caa ttt 115 
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Fig. 3 (con"t) 
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Fig, 3 (con't) 
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Fig.3(con't) 
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Gly Glu Lys Arg 


Lys 










650 










655 




660 






tac cgt 


cat 


aaa 


tct 


ggt 


gga 


tat 


get 


ate 


gga 


ggt gca gcg caa 


act 


2131 


Tyr Arg 


His 


Lys 


Ser 


Gly 


Gly 


Tyr 


Ala 


He 


Gly 


Gly Ala Ala Gin 


Thr 




Tyr Arg 


Kxs 


Lys 


Ser 


Gly 


Gly 


Tyr 


Ala 


He 


Gly 


Gly Ala Ala Gin 


Thr 








665 










670 






675 






tgt tct 


OaB. 


aac 


tta 


att 


age 


rtt 


gcc 


ttt 


rgc 


caa etc ttd ggt 


age 


2179 


Cys Ser 


Glu 


Asn 


Leu 


He 


Ser 


Phe 


Ala 


Phe 


Cys 


Gin Leu Phe Gly Ser 




Cys Ser 


Glu 


Asn 


Leu 


.He 


Ser 


Phe 


Ala 


Phe 


Cys 


Gin Leu Phe Gly 


Ser 






680 










685 








690 
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Fig. 3 (con't) 



gat 


aaa 


gat 


ttc 


tta 


gtc 


get 


aaa 


aat 


eat 


act gat ace 


tat 


gea 


gga 


2227 


Asp Lys 


Asp 


Phe 


Leu 


Val 


Ala 


Lys 


Asn 


His 


Thr Asp Thr Tyr Ala 


Gly 




Asp 


Lys 


Asp 


Phe 


Leu 


Val 


Ala 


Lys 


Asn 


His 


Thr Asp Thr Tyr Ala 


Gly 






695 










700 








705 










gcc 


ttc 


tat 


ate 


eaa 


cac 


att 


aca 




tgt 


agt ggg ttc 


ata 


ggt 


tgt 


2275 


Ala 


Phe 


Tyr 


lie 


Gin 


His 


He 


Thr 


Glu 


Cys 


Ser Gly Phe 


He Gly 


Cys 




Ala 


Phe 


Tyr 


lie 


Gin 


His 


He 


Thr 


Glu 


Cys 


Ser Gly Phe 


He Gly 


Cys 




710 










715 










720 






725 




etc 


tta 


gat 


aaa 


ett 


cct 


qqe 


tct 


tgg 


agt 


cat aaa ccc 


etc 


gtt 


tta 


2323 


Leu 


Leu 


Asp 


Lys 


Leu 


Pro 


Gly 


Ser 


Trp 


Ser 


His Lys Pro 


Leu 


Val 


Leu 




Leu 


Leu 


Asp 


Lys 


Leu 


Pro 


Gly 


ser 


Trp 


Ser 


His Lys Pro 


Leu 


Val 


Leu 












730 










735 






740 






gaa 


999 


cag 


etc 


qet 


tat 


age 


cac 


gtc 


agt 


aat gat etg 


aag 


aca 


aag 


2371 


Glu 


Gly 


Gin 


Leu 


Ala 


Tyr 


Ser 


His 


Val 


Ser 


Asn Asp Leu 


Lys 


Thr 


Lys 




Glu Gly 


Gin 


Leu 


Ala 


Tyr 


Ser 


His 


Val 


Ser 


Asn Asp Leu 


Lys 


Thr 


Lys 










745 










750 






755 








tat 


act 


acq 


tat 


cct 


paq 


gtg 


aaa 


ggt 


tct 


tgg ggg aat 


aat 


get 


ttt 


2419 


Tyr Thr 


Ala 


Tyr 


Pro 


Glu 


Val 


Lys 


Gly 


Ser 


Trp Gly Asn Asn Ala 


Phe 




Tyr Thr 


Ala 


Tyr 


Pro 


Glu 


Val 
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Ser 


Trp Gly Asn Asn Ala 


Phe 








760 










765 






770 










aac 


atg 


atg 


ttg 


gga 


get 


tet 


tet 


cat 


tct 


tat cct gaa 


tae 


etg 


eat 


2467 


Asn 


Met 


Met 


Leu 


Gly 


Ala 


Ser 


Ser 


His 


Ser 


Tyr Pro Glu Tyr Leu 


His 




Asn 


Met 


Met 


Leu 


Gly 


Ala 


Ser 


Ser 
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Ser 


Tyr Pro Glu 


Tyr 


Leu 


His 






775 










780 








785 










tgt 


ttt 


Qat 


acc 


tat 


qet 


cea 


tae 


ate 


aaa 


etg aat etg 


ace 


tat 


ata 


2515 


Cys 


Phe 


Asp 


Thr 


Tyr 


Ala 


Pro 


Tyr 


He 


Lys 


Leu Asn Leu Thr Tyr 


He 




Cys 


Phe 


Asp 


Thr 


Tyr 


Ala 


Pro 


Tyr 


He 


Lys 


Leu Asn Leu 


Thr 


Tyr 


He 




790 










795 










800 






805 




cgt 


cag 


gac 


aac 


ttc 


teg 


qaq 


aaa 


ggt 


aca 


gaa gga aga 


tct 


ttt 


gat 


2563 


Arg 


Gin 


Asp 


Ser 


Phe 


Ser 


Glu 


Lys 


Gly 


Thr 


Glu Gly Arg 


Ser 


Phe 


Asp 




Arg 


Gin 


Asp 


Ser 


Phe 


Ser 


Glu 


Lys 


Gly 


Thr 


Glu Gly Arg Ser 


Phe 


Asp 












810 










815 






820 






gac 


age 


aac 


etc 


ttc 


aat 


tta 


tet 


ttg 


cct 


ata egg gtg 


aag 


ttt 


gag 


2611 


Asp 


Ser 


Asn 


Leu 


Phe 


Asn 


Leu 


Ser 


Leu 


Pro 


He Gly Val 


Lys 


Phe 


Glu 




Asp 


Ser 


Asn 


Leu 


Phe 


Asn 


Leu 


Ser 


Leu 


Pro 


He Gly Val 


Lys 


Phe 


Glu 










825 










830 






835 








aag 


ttc 


tct 


gat 


tgt 


aat 


gac 


ttt 


tct 


tat 


gat erg act 


tta 


tec 


tat 


2659 


Lys 


Phe 


Ser 


Asp 


Cys 


Asn 


Asp 


Phe 


Ser 


Tyr 


Asp Leu Thr 


Leu 


Ser 


Tyr 




Lys 


Phe 


Ser 


Asp 


Cys 


Asn 


Asp 


Phe 


Ser 


Tyr 


Asp Leu Thr 


Leu 


Ser 


Tyr 








840 










84 5 






850 










gtt 


ccc 


gat 


cct 


ate 


cgc 


aac 


gac 


ccc 


aaa 


cgc acc aca 


cca 


etc 


cca 


2707 


Val 


Pro 


Asp 


Leu 


He 


Arg 


Asn 


Asp 


Pro 


Lys 


Cys Thr Thr 


Ala 


Leu 


Val 




Val 


Pro 


Asp 


Leu 


He 


Arg 
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Asp 


Pro 


Lys 


Cys Thr Thr 


Ala 


Leu 


val 
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Fig. 3 (con't) 



ate 


age 


gga 


gee 


tct 


tgg 


gaa 


act 


tat 
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aac tta gca cga 


cag 


2755 


lie 


Ser 


Gly 


Ala 


Ser 


Trp 


Glu 


Thr 


Tyr 


Ala 


Asn 


Asn Leu Ala Arg Gin 




lie 


Ser 


Gly 


Ala 


Ser 
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Glu 


Thr 


Tyr 


Ala 


Asn 


Asn Leu Ala Arg Gin 




870 










875 










880 




885 




gcc 


ttg 


caa 


gtg 


cgt 


gca 


ggc 


agt 


cac 


tac 


gcc 


ttc tct ect atg 


ttt 


2803 


Ala 


Leu 


Gin 


Val 


Arg 


Ala 


Gly 


Ser 


His 


Tyr 


Ala 


Phe Ser Pro Met 


Phe 




Ala 


Leu 


Gin 


Val 


Arg 


Ala 


Gly 


Ser 


His 


Tyr 


Ala 


Phe Ser Pro Met 


Phe 












890 










895 




900 






gaa 


gtg 


etc 


ggc 


eag 


ttt 


gte 


ttt 


gaa 


gtt 


cgt 


gga tec tea egg 


att 


2B51 


Glu 


Val 


Leu 


Gly 


Gin 


Phe 


Val 


Phe 


Glu 


Val 


Arg 


Gly Ser Ser Arg 


He 




Glu 


Val 


Leu 


Gly 


Gin 


Phe 


Val 


Phe 


Glu 


Val 


Arg 


Gly Ser Ser Arg 


He 










905 










910 






915 






tat 


aat 


gta 


gat 


ett 


ggg 


ggt 


aag 


ttc 


caa 


ttc 


taggagcgtc tctcatgtct 


2904 


Tyr 


Asn 


Val 


Asp 


Leu 


Gly 


Gly 


Lys 


Phe 


Gin 


Phe 








Tyr Asn 


Val 


Asp 
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Gly 


Gly 


Lys 
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Gin 


Phe 
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cagaaattet gagagagate geatttagga ttttcttaaa cacgac 2 950 
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Figure 4: Restriction enzyme analysis of GPN100635 



Apol 
Tsp509I 
Hpyl78III 
Alul 
CviJI 
MspAlI 
PvuII 



Ddel 
Hpyl88IX 
Hinfl 
Hpyl78III I 
BseNIll I 

Msel PvuII 1 I Plel I ' I 

I III I li I 

AGTTAAAGATGACAAAACyv^GCTGTCAAGAATTTTTATCTTGACTCrrCTGAGTT^ 
+ + + + + + 

TCAATTTCTACTGTTTTGTCGACAGTTCITAAAAATAGAACT^ 



60 



ApoI 
Tsp509I 
Hgal I Msel 



ScrPI 
EcoRII I 



Tsp509I 



TTATATGACGCAAGTAAGAATTTAATAATAAAGTGGGTTTATGAAATCGCy^TTTTCCTC 
51 + + + + + 

AATATACTGCGTT<^TTCTTAAATTATTATrrCACCCAAATACr^ 



Mwol 
CviRI 

BsiHKAI Pnu4HI I 

Bspl2 861 Earl Nlall I Bf al TspRI | | 

MboII I TaqI j MslI Nspl Spel | Bbvl Taal |Tsel| | 

I III I I II I I I III 

GTTAGTGCrrCTCnn'CGACATTGGCATGTTTTACTAGTTGTTCCy^ 

121 + + -h + + + 180 

OUVTCACGAGAGAAGCTGTAACCGTACAAAATGATCyVACAAGGTGAC^^ 



Hpyl88IX 
NlalV 
CviJI I 
Haelll I 
EcoO109l| I 
Sau96I I I 
III 



Mnll 
Alul I 
CviJI 



Rsal 
Seal 
Tat I I 
I I 



NlalV 
Bani |BsaXI 
I I I 



TGCTGAAAATATAGGCCCCTCTGATAGCTTTGACGGAAGTACTAACACAGGCACCTATAC 

181 + + + + + - + 240 

ACGACTTTTATATCCGGGGAGACTATCGAAACTGCCTTCATGATTGTGTCCGTGGATATG 



Cjel 



BsrI 



Cjel 
Hpyl88IX 



CviRI 



TCCTAAAAATACGACTACTGGAATAGACTATACTCTGACAGGAGATATAACTCTGCAAAA 

241 + + + + + + 300 

AGGATTTTTATGCTGATGACCTTATCTGATATGAGACTGTCCTCTATATTGAGACGT^ 



SUBSTITUTE SHEET (RULE 26) 



wo 00/32794 



PCT/CA99/01I47 



Fig. 4 (con't) 



24/77 



BsaJI 
Styl 
I 



Hinfl 
Tfil 
I 



Msel 
Alul 
CviJI 
Fnu4HI I 
Tsell I 
II I 



Bbvl 



Hpyl88IX 



Hinfl 
Tfil 



Alul 
CviJI 
Hindlll 1 
Msel I I 
I i I 



CCTTGGGGAITCGGCAGCTTTAACGAAGGGTTGTTTTTCTGACTICTACGGAA 
301 + + + + + + 

GGAACCCCrrAAGCCGTCGA?ATTGCrTCCCAACAAAAAGACTGTGATGCCTTAGA;^ 



360 



Pnu4HI 
Hhal I 

Mspl Dral Msel Tselj 

BsrFl| Bsll Rsal ' Msel|SspI | Bfal Mwol || 

II I I II II I III 

CrrTTGCCGGTAAGGGGTACTCACTTTCTTTTTTAAATATT 

361 + + + + + + 

GAAACGGCCaiTTCCCCa^TGAGTGAAAGAAAAAATTTATAATTCAGATCAaSACOT 



420 



EC057I 

Bbvl| Hinfl 
CviJI Maelllli BsaBI TaqI [ Plel 

I ill I Ml 

AGCCCiU-rLTiGTTACAACTGATAAAAATCTGTCGCTAACAGGA'ri^'CGAGTCTTACTTO 

421 + + + + + + 480 

TCGGGAAAGACAATGTTGACTATTTTTAGACAGCGATTGTCCTAAAAGCTCAGAATC 



Bed 
UlalV 1 
CviJI I I 
Haelllj 
Sau96I I 
Fnu4HI I 
Taul 
Acilt 



Cjel 
Ddel| 



I Bsll 



Mnll 

Hpyl78III Msel 
Cjel I BseMII I 

BSU36I 1 I CviRl| I 

Ddel I I Mnll I I I 



CTTAGCGGCCCCATCATCGGTAATCACAACCCCCTCAGGAAAAGGTGCAGTTAAATGTGG 

481 + + + + + + 540 

GAATCGCCGGGGTAGTAGCCATTAGTGTTGGGGGAGTCCTTTTCCACGTCAATTTACACC 



Bsgl 
Dpnl 
BstYI I 
Sau3AI 



541 



Dral Taal 
Alwl Msel I Mnll | 

Ml II II 
AGGGGATCTTACATTTGATAACAATGGAACTATTTTATTTAAACAAGATTAGTGTGAGGA 
^ ^ ^ + + ^ goo 

TCCCCTAGaATGTAAACTATTGTTACCTTGATAAAATAAATTTGTTCTAATGACACTC 
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601 



Clal 

CviJI TaqI 
Nlaiv) Dpnl| 
Ecil II Hinf I Sau3AI | | 

Acil| 11 Tfil Sthl32I BscGI | || Alwl 

Mil I I II II I 

AAATGGCGGAGCCATTTCTACCAAGAATCTTTCTTTGAAAAACTVGCACGGGA 

+ + - + + + + 660 

TTTACCGCCTCGGTAAAGATGGTTCTTAGAAAGAAACTTTTTGTCGTGCCCTAGCTAAAG 



661 



Hhal 

CjePlj CjePI 
TaqI I I CviJI Mwol | 

I II III 
TTTTGAAGGGAATAAATCGAGCGCAAC7«3GGAAAAAAGGTGGGGCTATT^ 

+. + + + + + 

AAAACTTCCCITATTTAGCTCGCGTTGTCCCTTTTTTCCACCCCGA^ 



720 



Taal 
Sfcl 
BsrI I 
Rsal 



MboII 
Nlaivj 
CviJI I I 

II 



BspMI 
Sspl 
Bbvl 
Mnll 



Earl 
Hpyl78III 
TaqI 
Bcefl I 
I 



Fnu4HI 
Sfcl 
Alul| 
CviJI I 
Mwol I 
Tselj 
II 



TACTGTAGATATTACAAATAATACGGCTCCTACCCTCTTCTCGAACAATATTGCTGAAGC 

721 + + + + -:--- + + 

ATGACATCtATAATGTTTATTATGCCGAGGATGGGAGAAGAGCTTGTTATAACGACITCG 



780 



EC057I 

Alul I 

CviJI I Tsp509I 

Mwol I Rsal 

Aarl I ^ BsrGI | 

Pat 1 1 I Taal j j BsmAI 

CviRI I I 1 Tat I I I Maell BsmBI 

I II I II 

TGCAGGTGGAGCTATTVAATAGCACAGGAAACTGTACAATTACAGGGAATACGTCTCTTGT 

781 -I- + + + + + 840 

ACGTCCACCTCGATATTTATCGTGTCCTTTGACATGTTAATGTCCCTTATGCAGAGAACA 



HpylBBIX 



Maelll 
Tsp45I 



Acil 
BsiEI Mnll 



BseRI 
Hpyl78III 
Banll 
BsiHKAI 
Bspl286I 
Sad 
SfaNI 
Alul I 
CviJI I j jcjel 

I I 



ATTTTCTGAAAATAGTGTGACAGCGACCGCAGGAAATGGAGGAGCTCTTTCTGGAGATGC 

841 -- + -- + + + + 

TAAAAGACTTTTATCACACTGTCGCTGGCGTCCTTTACCTCCTCGAGAAAGACCTCTACG 



900 
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Fig. 4 (con't) 



Maelll 



I 



Maelll 
HpylSBIX 
Hinfl I 
BslI I I 
Bpml [Tfil I 

I I I I 



901 



Hpyl78III 
iCjel Ddel I 
I I I 

CGATGTTACCyVTATCTGGGAATCAGAGTGTAACTTTCTCAGGAAACCAAGCTGTAGCTAA 

+ -T + + + + + 

GCTACJ^TGGTATAGACCCTTAGTCTCACATTGAAAGAGTCCTTTGGTTCGACATCGATT 



BseNII 
Sfcl 
Alul| Alul 
Cvi JI I Cvi JI 
II I 



960 



Neil 
ScrFI 
BsaJI 
Mspl 
BsrI 
TspRI 
CviJI 
Sthl32I 
Alul 

CviJI CviJI 
Nlaivl Hindlll | 

Ecil I I Ddel II I I I I I ^ Hinfl 

Acil| II Mwol I II I Mil BslI Tfil 

11 II I I II 

TGGCGGAGCCATTTATGCTAAGAAGCTTACACTGGCTTCCGGGGGGGGGGGGGGGAATC 

951 + + + + + 1020 

ACaSCCTCGGTAAATACGATTCTTCGAATGTGACCGAAGGCCCCCCCCCCCCCCCrrAGG 



PstI 
TspRI 
CviRI 
HgiEII 
BtsI 
SfCl 
Kpnl 
NlalV 



Rsal 
BanI 
BspMI 
BsaJI I 
Styl I 
Cjell I 
II I 



CviJI 
NlalVj 
M 



Xcml 
Cjel I 



CTTTTCTAACAATATAGTCCAAGGTACCACTGCAGGTAATGGTGGAGCCATTTCTATACT 

1021 + + + + + --- + 1080 

GAAAAGATTGTTATATCAGGTTCCATGGTGACGTCCATTACCACCTCGGTAAAGATATGA 
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Alul 
CviJI 
MspAlI 
PvuII 
Bsri I 
Fnu4HI I 
Tsel| I 
II I 



Bbvl 



Bpml 
I 



NlalV 
Avail) 
Sau96X| 
II 



BsmFl 
Msel I 
Xcmlj I 
II I 



GGCAGCTGGAGAGTGTAGTCriTTTCAGCGAAGCAGGGGACCa^TTACCrTAATGC^^ 



1081 



-+ 1140 



CCGTCGACCTCrCACATCyVGAAAAGTCGCnrCGTCCCCTGGTAATGGAATTACCCITA 

Dpnl 
BstYI 

CviRI Sau3AI 
Mwol I i^ol HaelV | | Bcgl 

BsmI I Tsp509I Hin4I j j Alwl | 

III I I II II 

CATTGTTGCAACTACACCACAAACrACAAAAAGAAATTCTATTC^ 

1141 + + + + + + 1200 

GTAACAACGTTGATGTGGTGTTTGATGTTTTTCTTTAAGATAA 



Tsp509I 
Hpyl78III 
Dpnl j 
Sau3AI 
BsrI I 



CviRI 
BsaAI I 
Maell I I 



Bcgl 



HpylBBIX 
Dpnl 
Sau3AI 
PflllOSI j 
SfaNI 
Alwl I 1 



Bbvl 



CAAAGATCACGAATTACGTGCAATATCTGGGCATAGCATCrrr^ 

1201 + +- + + + + 1260 

GTTTCTAGTGCTTAATGCTVCGTTATAGACCCGTATCGTAGAAAAAGATGCrrAGGCTAATC 



Hinfl 
Tfil 
Acil 
Fnu4HI 
CviJI I 

Tselj I Beef I Dral 
Mwol I 1 I Sf cl I Msel I 
I II 

TGCTAATACGGCTGCGGATTCTACAGATACTTTAAATCTCAATAAGGCTGATGCAGGTAA 

1261 + + + + + + 1320 

ACGATTATGCCGACGCCTAAGATGTCTATGAAATTTAGAGTTATTCCGACTACGTCCATT 



CviJI 
BspMI I 
SfaNI I I CviRI 



HpylBBIX 
Hphl| 

Rsal TaqI Alul | | 

TatI I SimI I CviJI | j 

II I I I II 

TAGTACAGATTATAGTGGGTCGATTGTTTTTTCTGGTGAAAAGCTCTCTGAAGATGAAGC 

1321 + + + --- + + + 1380 

ATCATGTCTAATATCACCCAGCTAACAAAAAAGACCACTTTrCGAGAGACTTCTACTTCG 
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EC057I 
CviRll 
BstAPI I I 
MWOI I I 
Mboll I I I 



Maelll 
CviJI 
Pnu4HI I 
Mwol ( I 
Mnll Tselj I 

till I Ml I I II I 

AAAAGTTGCyiGACAACCTCACTTCTACGCTGAAGCAGCCTGTAACTCTAACTGCAGGAAA 

13B1 + + + + + + 1440 

TTTTCyU^CGTCTGTTGGAGTGAAGATGCGACrrCGTCGGACATTGAGAlTGACGTC^^ 



ApoX 
Tsp509I 
PstI 
CviRI 
Sfcl I 
Eco57l| I 
Bbvl I I I 



BseMII 
NlalV 
SacII 
Acil 
MspAlI 
Thai 
Acil 
BsaJI 
BstDSI 

Maell Hpyl88IX 
Msel I Faul j 

Rsal I I Hpyl78III Ddel 

Seal I iMaelll j Sthl32I 

TatI I I |Tsp45I Taqi CviJI ( 

11 I I I I II 

TTTAGTACTTAAACGTGGTGTCACTCTCGATACGAAAGGCrTTACTC^ 

1441 + + + + + + 1500 

AAATCATGAATTTGOVCCSVCAGTGAGAGCTATGCTTTCCGAAAtGAG 



Faul 
Mnll 
Sthl32l| 
SfaNI I I 

I II 



Msel 
Acll 
Fokl 
Maell 
Bspl286I 
Bmgl 
BseSI 
Cac8I I 
Acil I I 
i I I 



Mnll 
Rsal 
TatI I 
Mnll I I 
II I 



Maelll 
Tsp45I 
I 



BseRI 
Msel I 



CTCrrGTTATTATGGATGCGGGCACAACGTTAAAAGCAAGTACAGAGGAGGTCACTTTJ^ 

1501 + + + + + + 1560 

GAGACaATAATACCTACGCCCGTGTTGCAATTTTCGTTCATGTCrCCTCCAGTGA;^ 



Hinf I 
AccI I 
Sfcl I I 
Plel I I Mnll 



Fnu4HI Sfcl 
Bbvl Tsp509I Tsel| Mwol | 



AGGTCTTTCCyiTTCCrGTAGACTCTTTAGGCGAGGGTAAGAAAGTTGTAATTGCrrGCn^ 

1561 + + ---+ + + + 1620 

TCCAGAAAGGTAAGGACATCTGAGAAATCCGCTCCCATTCTTTCAACATTAACGACGAAG 
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Hinfl 

PstI Tfil 

Pnu4Hl| Hpyl88IX 

CviRlll Avail I 

Mwol|| Ddel MboII | | BsaJI 

Tseljl Bbvl CviJI |Sau96I j | Styl 

III I I I I II I 

TGCAGCAAGTAA/^TGTAGCCCTTAGTGGTCCGATTCTTCTTTTGGATAACCAAGGGAA 

1621 + + + + + + 1680 

ACGTCGTTCATTTTTACATCGGGAATCACCAGGCTAAGAAGAAAACCTATTGG 



Alul 
CviJI 

Ddel Fnu4HI | 

Bce83I I Hpyl78III CviRI | | 

BsmI Hpyl78III | Smll | Tselj | 

I I I II II I 

TGCTTATGAAAATCACGACTTAGGAAAAACTCAAGACrrTTTeAT^^ 

1681 + + ---+ + + + 1740 

AOSAATACrTTTAGTGCTGAATCCTTTTTGAGTTCTGAAAAGTi^ 



CviRI 
Acelll Bsgl I 
Bbvl I RsaX| j 



Sfcl 



Sfcl 
NlalV I 
Acil I I Cjel 
MspAlI I |TaaI I 



TCTGGGTACTGCAACAACTACAGATGTTCCAGCGGTTCCTACAGTAGCAACTCCTACGCA 

1741 + + + + + + 1800 

AGACCCATGACGTTGTTGATGTCTACAAGGTCGCCAAGGATGTCATCGTTGAGGATG 



Cjel 



CacSI 
Acil I 



Rsal I 

II I i 

CTATGGGTATCAAGGTACTTGGGGAATGACTTGGGTTGATGATACCGCAAGCACTCCAAA 

1801 + + + + + + 1860 

GATACCCATAGTTCCATGAACCCCTTACTGAACCCAACTACTATGGCGTTCGTGAGGTTT 



Tthlllll 
Ddel I 
II 



Avail 
Sau96I 
Alul I 
CviJI I 
Mwol I I 



BpulOI 
Ddel 
Hpyl78III 
Hinfl 
Tfil 
Hgal 



I I I 



Hpyl88IX 
BseMII 
CviJI I 

I 1 



GACTAAGACAGCGACATTAGCTTGGACCAATACAGGCTACCTTCCGAATCCTGAGCGTCA 

1861 ^^---+ + + + + + 1920 

CTGATTCTGTCGCTGTAATCGAACCTGGTTATGTCCGATGGAAGGCTTAGGACTCGCAGT 
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Avail 
EC00109I 

PspSII 

Sau96X 
Sse8647I Alol 



30/77 



Dpnl 
BstYI I 
Fokl |Hpyl88IX 
CviJI Sau3AI I Alwl | 



Hinfl 
Tfil 



AGGACCrnTAGTTCCTAATAGCCrrTTGGGGATCTTTTTCAGAC^ 

1921 : + + + + + + 1980 

TCCTGGAAATCAAGGATTATCGGAAACCCCTAGAAAAAGTCTGTAGGTTCGCTAAGTTCC 



Mwol 
Hinfl 

Bbvl Acil 
TaqI Fnu4HI 
Dpnl I CviJI I 

Sau3AI I I Faul I I 

HpylBBIXl II Sthl32l| || 
Plel Hinfl Mnll| | | | CviJI ||Tsel| 

I I Ml II I II 11 

TGTCATAGAGAGAAGTGCTTTGACTCITTGTTCAGATCGAGGCTTCrG^ 

1981 + + + + + + 2040 

ACAGTATCrrCTCTTCACGAAACTGAGAAACAAGTCTAGCTCCGAAGACCCGACGCCCTCA 



Plel 

Tsp509l| Ddel Taal 

II I I 

CGCCAATTTCTTAGATAAAGATAAGAAAGGGGAAAAACGCAAATACCGTCATAAATCTGG 



2041 



2100 



GCGGTTAAAGAATCrrATTTCTATTCnOTCCCCTTTTTGCGTTTATGGC^ 



HpylseiX 

Mnll I Fnu4HI 

Haeivj I CviRl| 

Hin4l| I TseI|HhaI 
I II 



T9P509I 
BsgX I 
HpylSSIX I I Alnl 

Bbvl 1 I Mselj CviJI 
I I I II I 



TGGATATGCTATCGGAGGTGCAGCGCAAACTTGTTCTGAAAACTTAATTAGCTTTGCCTT 



2101 



-+ 2160 



ACCTATACGATAGCCTCCACGTCGCGTTTGAACAAGACTTTTGAATTAATCGAAACGGAA 



BstXI 



Ddel 



TTGCCAACTCTTTGGTAGCGATAAAGATTTCTTAGTCGCTAAAAATCATACTGATACCTA 



2161 



-+ 2220 



AACGGTTGAGAAACGATCGCTATTTCTAAAGAATCAGCGATTTTTAGTATGACTATGGAT 
CviJI 

NlalV I BsmAI 
CviRI I Bsbl Mmel Ddel| 

i II I I II 

TGCAGGAGCCTTCTATATCCAACACATTACAGAATGTAGTGGGTTCATAGGTTGTCTCTT 

2221 + + + + + + 2280 

ACGTCCTCGGAAGATATAGGTTGTGTAATGTCTTACATCACCCAAGTATCCAACAGAGAA 



SUBSTITUTE SHEET (RULE 26) 



wo 00/32794 



PCT/CA99/0n47 



31/77 



Fig. 4 (con't) 



CacBI 
Alul 

CviJI CviJI 
ScrFI I Fnu4HI | 

EcoRII I I Hinfl Plel Mnll Tsel | | 

I I I I I I II I 

AGATAAACTTCCTGGCTCTTGGAGTCATAAACCCCrCGTTTTAGAAGGGCJ^ 

2281 :+ + + + + + 2340 

TCTATTTGJ^GGACCGAGAACCTCT^GTATTTGGGGAGCAAAATCTTCCCGTCG^ 



BSU36I 
Ddel 
Hpyl78III 
Ecd57I 
BseMII 
MboII 



Mae 1 1 
CviJI I 
Acelll I 1 
Bbvl I ( I 

II 



HpylBBIX BstZ17l| 
Dpnl I AccI 1 I 

SauSAI I t Bbsl III I Mnll I j BciVI 

Mi I III I I 

TAGCCACGTCAGTAATGATCTGAAGACy^GTATACTGCXSTATCCTGAGGTGAAAGGTTC 

2341 + + + + + + 2400 

ATOKTrcCAGTCATTACrAGACTTCrGTTTCATATGACGC^ 



MboII Alul 

HphI Msel Nlalll j CviJI Hpyl78III 

I I I I I I 

TTGGGGGAATAATGCTTTTAACATGATGTTGGGAGCTTCTTCTCATTCa^ 



2401 



-+ 2460 



AACCCCCTTATTACGAAAATTGTACTACAACCCTCGAAGAAGAGTAAGAATAGGACTTAT 
BspMI 



Hpyl88IX 
Hinfl I Hpyl78III 
CvlRI I BsaXI Tfil I Maell | 

II I II II 

CCTGCATTGTTTTGATACCTATGCTCCATACATCAAACTGAATCTGACGTATATACGl^ 

2461 + + + --+ + + 2520 

GGACGTAACAAAACTATGGATACGAGGTATGTAGTTTGACTTAGACTGGATATATGCAGT 



Hpyl88IX 
Alul I 
CviJI 1 
Hin4I 



Dpnl 
Bglll I 
BstYI I 
Sau3AI i 



MboII 
MboII I 



Earl 
Tsp509l| 



I Rsal 

I I I 

GGACAGCTTCTCGGAGAAAGGTACAGAAGGAAGATCTTTTGATGACAGCAACCTCT^ 

2521 + + + + + + 2580 

CCTGTCGAAGAGCCTCTTTCCATGTCTTCCTTCTAGAAAACTACTGTCGTTG^ 

Hpyl88IX 
Hin4I I 

Mnll Sfcl HphI I I BplI 

II III I 

TTTATCTTTGCCTATAGGGGTGAAGTTTGAGAAGTTCTCTGATTGTAATGACTT^^ 

2581 + + + + + + 2640 

AAATAGAAACGGATATCCCCACTTCAAACTCTTCAAGAGACTAACATTACTGAAAAGAAT 
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BsrDI 
Dpnl I 

Hpyl88IX Jypnl Sau3AI t | 

Dpnl I Sau3AI I Alwl | | j Sf cl 

Sau3AI I I Hpyl78IIl| | Acil| | jj CviRI | 

III III II I II II 

TGATCTGACITTATCCTATGTTCCTCATCTTATCCGCAATGATCCCTV^ 

2641 + + + + + + 2700 

ACTAGACraAAATAGGATAaU^GGACrAGAATAGGCGTTACTAGGGTTTACGTGATGTC^ 



CviJI 

CviJI Hael 
NlalVl Haelll 
Acil II StuI 
MspAlI I I Mnll Ddel Mwol | 

I II I I II 

ACTTGTAATCAGCGGAGCCTCTTGGGAAACTTATGCCAATAACTTAGCy^CGACAGG 

2701 + + + + + + 2760 

TGAACATIAGTCGCCTCGGAGAACCCTTTGAATACGGTTATTGAATCGTGCTGTCCGGAA 



Maelll 
Tsp45I 
CacBI I 
CviRI I I 
CviRI CacSI 1 



Bsgl 
Tthlllll I 



Bsri 
CviJI 
Hael II 
Eael 
Gdill 
BsiHKAll 
Bspl286l| 



GCAAGTGCGTGCAGGCAGTCACTACGCCTTCTCTCCTATGTTTGAAGTGCTCGGCCAGTT 

2761 + + + + --+ 2820 

CGTTCACGCACGTCCGTCAGTGATGCGGAAGAGAGGATACAAACTTCACGAGCCGGTCAA 



Dpnl 
NlalV 
BatnHI 
BstYI 
SauBAI 
Alwl I 

I I 



Alwl Mnll 



Dpnl 
Bglll I 
BstYI I 
SauBAI 



Hgal 
Tsp509I 



TGTCriUXjAAGTTCGTGGATCCrCACGGATTTATAATGTAGATCTTGGGGGTAAGTTCCA 

2821 + + + + + 2880 

ACAGAAACTTCAAGCACCTAGGAGTGCCTAAATATTACATCTAGAACCCCCATTCAAGGT 
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Fig. 4 (coni) 



Dpnl 
Sau3AI 
BseMII 
Ddel 
Hpyl88IX 
BplI 
Apol 
Tsp509I 
BsmAI 
Hpyl88IX| 
BseMII I 
Ddell 
Nlalll 
BsmAI I 
Bfal BsmBI j 

BslI Hin4X I I I I I I I I I III Msel 

I III 
ATTCTAGGAGOSTCrrcrCATGTCTCAGAAATTCTGAGAGAGATCGCAl^ 

2881 + + + + + + 2940 

TAAGATCCTCGCAGAGAGTACAGAGTCrTTAAGACTCTCrCTAGCGTAAATCCTAAAAGA 



TAAACACGAC 

2941 + 2950 

ATTTGTGCTG 
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Figure 5: Nucleotide and deduced amino acid sequence of CPN100638 



tgtagattct taacttacta gtctctcctt tcctcttgct ttctttraatt tattgcagta 60 • 

tgtggtgaaa taatttgtta aaccacctat agccctctac atg aaa tec tct ctt 115 

Met Lys Ser Ser Leu 
1 5 

cat tgg ttt tta ate teg tea tct tta gca ett eee ttg tea eta aat 163 
His Trp Phe Leu He Ser Ser Ser Leu Ala Leu Pro Leu Ser Leu Asn 
10 15 20 

ttc tet geg ttt get get gtt gtt gaa ate aat eta gga cet ace aat 211 
Phe Ser Ala Phe Ala Ala Val Val Glu He Asn Leu Gly Pro Thr Asn 
25 30 35 

age ttc tct gga eca gga ace tae act cct cca gcc caa aca aca aat 259 
Ser Phe Ser Gly Pro Gly Thr Tyr Thr Pro Pro Ala Gin Thr Thr Asn 
40 45 50 

gca gat gga act ate tat aat eta aca ggg gat gtc tea ate ace aat 307 
Ala Asp Gly Thr He Tyr Asn Leu Thr Gly Asp Val Ser He Thr Asn 
55 60 65 

gca gga tet eeg aca get eta acc get tec tgc ttt aaa gaa act act 355 
Ala Gly Ser Pro Thr Ala Leu Thr Ala Ser Cys Phe Lys Glu Thr Thr 
70 75 80 85 

ggg aat ctt tct ttc caa gge cac ggc tac caa ttt etc eta caa aat 403 
Gly Asn Leu Ser Phe Gin Gly His Gly Tyr Gin Phe Leu Leu Gin Asn 
90 95 100 

ate gat geg gga geg aac tgt acc ttt acc aat aca get gca aat aag 451 
He Asp Ala Gly Ala Asn- Cys Thr Phe Thr Asn Thr Ala Ala Asn Lys 
105 110 115 

ett etc tec ttt tea gga ttc tec tat ttg tea eta ata caa ace aeg 499 
Leu Leu Ser Phe Ser Gly Phe Ser Tyr Leu Ser Leu He Gin Thr Thr 
120 125 130 

aat get acc aca gga aca gga gcc ate aag tec aca gga get tgt tet 547 
Asn Ala Thr Thr Gly Thr Gly Ala He Lys Ser Thr Gly Ala Cys Ser 
135 140 145 

att eag teg aac tat agt tgc tac ttt ggc caa aac ttt tct aat gac 595 
He Gin Ser Asn Tyr Ser Cys Tyr Phe Gly Gin Asn Phe Ser Asn Asp 
150 155 " 160 165 

aat gga ggc gcc etc caa ggc age cct a::c age cca teg eta aac cec 643 
Asn Gly Gly Ala Leu Gin Gly Ser Ser lie Ser Leu Ser Leu Asn Pro 
170 175 180 

aac eta acg ttt gee aaa aac aaa gca acg caa aaa ggg act gcc etc 691 
Asn Leu Thr Phe Ala Lys Asn Lys Ala Thr Gin Lys Gly Gly Ala Leu 
185 190 195 
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tat tec acg gga ggg att aca att aac aat acg tta aac tea gca tea 739 
Tyr Ser Thr Gly Gly lie Thr lie Asn Asn Thr Leu Asn Ser Ala Ser 
200 205 210 

ttt tct gaa aat ace gcg gcg aac aat ggc gga gcc att tac acg gaa 787 
Phe Ser Glu Asn Thr Ala Ala Asn Asn Gly Gly Ala lie Tyr Thr Glu 
215 220 225 

get age agt ttt att age age aac aaa gca att age ttt ata aac aat 835 
Ala Ser Ser Phe lie Ser Ser Asn Lys Ala He Ser Phe lie Asn Asn 
230 235 240 245 

agt gtg ace gca acc tea get aca ggg gga gee att tac tgt agt agt 883 
Ser Val Thr Ala Thr Ser Ala Thr Gly Gly Ala He Tyr Cys Ser Ser 
250 255 260 

aca tea gcc ccc aaa cca gtc tta act eta tea gac aac ggg gaa ctg 931 
Thr Ser Ala Pro Lys Pro Val Leu Thr Leu Ser Asp Asn Gly Glu Leu 
265 270 275 

aac ttt ata gga aat aca gca att act agt ggt ggg gcg att tat act 979 
Asn Phe He Gly Asn Thr Ala lie Thr Ser Gly Gly Ala He Tyr Thr 
280 285 290 

gac aat eta gtt ctt tct tct gga gga ect acg ett ttt aaa aac aac 1027 
Asp Asn Leu Val Leu Ser Ser Gly Gly Pro Thr Leu Phe Lys Asn Asn 
295 300 305 

tct ggc tat gat act gca get ccc tta gga gga gca att gcg att get 1075 
Ser Gly Tyr Asp Thr Ala Ala Pro Leu Gly Gly Ala He Ala He Ala 
310 315 320 325 

gac tct gga tct ttg agt ctt teg get ett ggt gga gac ate act ttt 1123 
Asp Ser Gly Ser Leu Ser Leu Ser Ala Leu Gly Gly Asp lie Thr Phe 
330 335 340 

gaa gga aac aca gta gte aaa gga get tct teg agt eag ace act acc 1171 
Glu Gly Asn Thr Val Val Lys Gly Ala Ser Ser Ser Gin Thr Thr Thr 
345 350 355 

aga aat tct att aac ate gga aac acc aat get aag att gta cag ctg 1219 
Arg Asn Ser He Asn He Gly Asn Thr Asn Ala Lys He Val Gin Leu 
360 365 370 

cga gcc tct caa ggc aat act ate tac tte tat gat cct ata aca act 1267 
Arg Ala Ser Gin Gly Asn Thr He Tyr Phe Tyr Asp Pro He Thr Thr 
375 380 385 

age ate ac:: cca get etc tea gat get eta aac tta aat ggt cct gac 1315 
Ser He Thr Ala Ala Leu Ser Asp Ala Leu Asn Leu Asn Gly Pro .-.sp 
390 395 400 405 

ctt gca ggg aat cct gca tat caa gga acc ate gta ttt tct gga gag i363 
Leu Ala Gly Asn Pro Ala Tyr Gin Gly Thr He Val Phe Ser Gly Glu 
410 ^ 415 420 
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aag etc teg gaa gca gaa get gca gaa get gat aat ete aaa tet aca 1411 
Lys Leu Ser Glu Ala Glu Ala Ala Glu Ala Asp Asn Leu Lys Sex Thr 
425 430 435 

att eag caa eet eta act ctt gcg gga ggg caa etc tet ctt aaa tea 1459 
lie Gin Gin Pro Leu Thr Leu Ala Gly Gly Gin Leu Ser Leu Lys Ser 
440 445 450 

gga gte act eta gtt get aag tec ttt teg caa tet, ccg ggc tet ace 1507 
Gly Val Thr Leu Val Ala Lys Ser Phe Ser Gin Ser Pro Gly Ser Thr 
455 460 465 

etc etc atg gat gca ggg ace aca tta gaa acc get gat ggg ate act 1555 
Leu Leu Met Asp Ala Gly Thr Thr Leu Glu Thr Ala Asp Gly lie Thr 
470 475 480 485 

ate aat aat ctt gtt etc aat gta gat tec tta aaa gag acc aag aag 1603 
He Asn Asn Leu Val Leu Asn Val Asp Ser Leu Lys Glu Thr Lys Lys 
490 495 500 

ggc acg eta aaa gca aca caa gca agt eag aca gte act tta tet gga 1651 
Gly Thr Leu Lys Ala Thr Gin Ala Ser Gin Thr Val Thr Leu Ser Gly 
505 510 515 

teg etc tet ctt gta gat eet tet gga aat gtc tac gaa gat gte tet 1699 
Ser Leu Ser Leu Val Asp Pro Ser Gly Asn Val Tvr Glu Asp Val Ser 
520 525 530 

tgg aat aac eet caa gtc ttt tet tgt ete act ctt act get gac gae 1747 
Trp Asn Asn Pro Gin Val Phe Ser Cys Leu Thr Leu Thr Ala Asp Asp 
535 540 545 

cec gcg aat att eac ate aca gae tta get get gat eec eta gaa aaa 1795 
Pro Ala Asn He His He Thr Asp Leu Ala Ala Asp Pro Leu Glu Lys 
550 555 560 565 

aat eet ate cat tgg gga tac eaa ggg aat tgg gca tta tet tgg caa 184 3 
Asn Pro He His Trp Gly Tyr Gin Gly Asn Trp Ala Leu Ser Trp Gin 
570 575 580 

gag gat act gcg act aaa tec aaa gca gcg act ctt acc tgg aca aaa 1891 
Glu Asp Thr Ala Thr Lys Ser Lys Ala Ala Thr Leu Thr Trp Thr Lys 
585 590 595 

aca gga tac aat ccg aat cct gag cgt cgt gga acc tta gtt get aac 1939 
Thr Gly Tyr Asn Pro Asn Pro Glu Arg Arg Gly Thr Leu Val Ala Asn 
600 605 610 

acg eta tgg gga tec ttt gtt gat gtg cgc tec aza caa eag ctt gta 1987 
Thr Leu Trp Gly Ser Phe Val Asp Val Arg Ser I.e Gin Gin Leu Val 
615 620 625 

gee act aaa gta cgc caa tet caa gaa act cgc cgc ate tgg tqt gaa 203 5 
Ala Thr Lys Val Arg Gin Ser Gin Glu Thr Arg Gly He Trp Cys Glu 
630 635 640 645 
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ggg ate teg aac ttc ttc cat aaa gat age acg aag ata aat aaa ggt 2083 
Gly He Ser Asn Phe Phe His Lys Asp Ser Thr Lys He Asn Lys Gly 
650 655 660 

ttt ege cac ata agt gca ggt tat gtt gta gga gcg act aca aca tta 2131 
Phe Arg His He Ser Ala Gly Tyr Val Val Gly Ala Thr Thr Thr Leu 
665 670 675 

get tot gat aat ctt ate act gea gee ttc tgc caa tta tte ggg aaa 2179 
Ala Ser Asp Asn Leu He Thr Ala Ala Phe Cys Gin Leu Phe Gly Lys 
680 685 690 

gat aga gat cae ttt ata aat aaa aat aga get tct gee tat gca get 2227 
Asp Arg Asp His Phe He Asn Lys Asn Arg Ala Ser Ala Tyr Ala Ala 
695 700 705 

tet etc cat etc cag eat eta gcg ace ttg tct tct eca age ttg tta 2275 
Ser Liau His Leu Gin His Leu Ala Thr Leu Ser Ser Pro Ser Leu Leu 
710 715 720 725 

cgc tae ctt ect gga tct gaa agt gag cag cct gtc etc ttt gat get 2323 
Arg Tyr Leu Pro Gly Ser Glu Ser Glu Gin Pro Val Leu Phe Asp Ala 
730 735 740 

cag ate age tat ate tat agt aaa aat act atg aaa ace tat tae ace 2371 
Gin He Ser Tyr He Tyr Ser Lys Asn Thr Met Lys Thr Tyr Tyr Thr 
745 750 755 

caa gca eca aag gga gag age teg tgg tat aat gac ggt tgc get ctg 2419 
Gin Ala Pro Lys Gly Glu Ser Ser Trp Tyr Asn Asp Gly Cys Ala Leu 
760 765 770 

gaa ctt gcg age tec eta eca cae act get tta age cat gag ggt etc 2467 
Glu Leu Ala Ser Ser Leu Pro His Thr Ala Leu Ser His Glu Gly Leu 
775 780 785 

ttc cac gcg tat ttt cet ttc ate aaa gta gaa get teg tac ata cae 2 515 
Phe His Ala Tyr Phe Pro Phe He Lys Val Glu Ala Ser Tyr He His 
790 795 800 805 

eaa gat age ttc aaa gaa egt aat act ace ttg gta cga tct ttc gat 2563 
Gin Asp Ser Phe Lys Glu Arg Asn Thr Thr Leu Val Arg Ser Phe Asp 
810 815 820 

age ggt gat tta att aac gtc tct gtg cct att gga att ace ttc gag 2611 
Ser Gly Asp Leu He Asn Val Ser Val Pro He Gly He Thr Phe Glu 
825 830 835 

aga ttc teg aga aac gag cgt gcg tct tac gaa get act gtc ate tac 2659 
Arg Phe Ser Arg Asn Glu Arg Ala Ser Tyr Glu .-.la Thr Val He Tyr 
840 845 850 

gtt gee gat gtc tat egt aag aat cet gac tgc acg aca get etc eta 2707 
Val Ala Asp Val Tyr Arg Lys Asn Pro Asp Cys Thr Thr Ala Leu Leu 
855 860 B65 
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Fig. 5 (con't) 

ate aac aat acc teg tgg aaa act aca gga acg aat etc tea aga eaa 2755 
lie Asn Asn Thr Ser Trp Lys Thr Thr Gly Thr Asn Leu Ser Arg Gin 
870 875 880 885 

get ggt ate gga aga gca ggg ate ttt tat gee ttc tct cca aat ett 2803 
Ala Gly lie Gly Arg Ala Gly He Phe Tyr Ala Phe Ser Pro Asn Leu 
890 895 900 

gag gtc aca agt aac eta tct atg gaa att cgt gga tet tea cgc age 2851 
Glu Val Thr Ser Asn Leu Ser Met Glu He Arg Gly Ser Ser Arg Ser 
905 910 915 

tac aat gea gat ctt gga ggt aag ttc cag ttc taaaagcgtt cctgatccct 2904 
Tyr Asn Ala Asp Leu Gly Gly Lys Phe Gin Phe 
920 925 

tagaaattct aagagatcct gagtgtatct agggacttct caaaga 2950 
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Figure 6: Restriction enzyme analysis of CPNl 00638 



Tsp509I 

Hinfl Bfal Msel | 

Tfil Msel Spelj BsmAI Mnll || CviRI 

I I 11 I I 11 I 
TGTAGATTCTTAACTTACTAGTCTCTCCTTTCCTCTTGCTTTCTTTAATTTATTGC^ 
1 + + + + + + 60 

ACATCTAAGAATTGAATGATCAGAGAGGAAAGGAGAACGAAAGAAATTAAATAACGTCAT 



Msel 

Tsp509I HphI I 



Mboll 

CviJI Mnll Mnll 

Sfcl i Nlalllj Earl I 

II i i II II 
TGTGGTGAAATAATTTGTTAAACCACCTATAGCCCTCrACATGAAATCCTCrcrr^ 
61 + + + + + + 120 

ACACCACTTTATTAAACAATTTGGTGGATATCGGGAGATGTACnTTAGGAGAG^ 



Bbvl 
ApoX I 
Tsp509I I Fnu4HI 
Maelll I I Mwol| 

Msel Tsp45I 1 I Tselj 

I I I i II 

GTTTTTAATCTCGTCATCriTAGCACTTCCCITGTCACT 

121 + + + + + + 

CAAAAATTAGAGCAGTAGAAATCGTGAAGGGAACAGTGATTTAAAGAGACGCAAAC^ 



180 



NlalV 
BseRX 
Bptnl 



Avail 
ECOO109I 
PspBII 
Sau96I 
Sse8647I 
Bfal I 



ScrFI 
EcoRII 
Avail 
BspGI 
Sau96I 
Hpyl78IIl| 
Alul I I 

CviJI I I 



|BsaXI 
lAlolj 
|Ppil| 
II 

TGTTGTTGAAATCAATCTAGGACCTACCAATAGCTTCTCTGGACCAGGAACCTACACTCC 

181 + + + '+ + + 240 

ACAACAACTTTAGTTAGATCCTGGATGGTTATCGAAGAGACCTGGTCCTTGGATC 



Bed 

Mnll Tthlllllj 
CviJI I CviRI I I HphI BsmAI 

II I M II 

TCCAGCCCAAACAACAAATGCAGATGGAACTATCTATAATCTAACAGGGGATGTCTCAAT 

241 + + + + + + 300 

AGGTCGdGTTTGTTGTTTACGTCTACCTTGATAGATATTAGATTGTCCCCTACAGAGTTA 
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Alul 
CviJI 
Alwl 
HpylBSIX 
Dpnl 
I 



BsrI 
Hinfl 
Tfil 
Bmrl I 



BstYI I II I Acelll Dral 
Sau3AI I II I AcxI ( Msel| 
Fokl CviRI II II I Mwol I I Mmel | | 

I I II II I II I III 

CACCAATGCAGGATCTCCGACAGCTCTAACCGCnTCCTGCTTTAAAGAAACTACTCGGAA 

301 + + + + + + 

GTGGTTACGTCCTAGAGGCTGTCGAGATTGGCGAAGGACGAAATTTCTTTGATGACCCTT 



360 



BsaJI 
BstDSI 
CviJI 
Hael 
Haelll 
BsaJI j 
Styl 



Clal 
TaqI 
Faul 
Sthl32l| 

.Tsp509I Sf aNI | j 

CviJI I Beef I I III Acil 

I II I I II III I 

TCTTTCTTTCCAAGGCCACGGCTACCAATTTCTCCTACAAAATATCGATGCGGGAGCGAA 

361 + + + + + + 420 

AGAAAGAAAGGTTCCGGTGCCGATGGTTAAAGAGGATGTTTTATAGCTACGCCCTCGCTT 



Alul 
CviJI 
Hindi! I 
CviRI 
Fnu4HI 
Alul I 
CviJI I 

Bbvl MspAllj I j I Hinfl 

Rsal I Pvullj I I 1 Tfil Maelll 

Taal I I Tselj | | | Hpyl78III j Tsp45I 

11 I II 
CTGTACCTTTACCAATACAGCTGCAAATAAGCTTCTCTCCTTTTCAGGATTCTCCTATTT 

421 + + ~- + + + + 480 

GACATGGAAATGGTTATGTCGACGTTTATTCGAAGAGAGGAAAAGTCCTAAGAGGATAAA 



Bed 
CviJI I 
NlaIV| I 



Alul 
CviJI 



BsmI 

i II I I 

GTCACTAATACAAACCACGAATGCTACCACAGGAACAGGAGCCATCAAGTCCACAGGAGC 

481 + + + --- + + + 540 

CAGTGATTATGTTTGGTGCTTACGATGGTGTCCTTGTCCTCGGTAGTTCAGGTGTCCTCG 
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Fig. 6 (con't) 



CviJI 
Hael 
Haelll 
Msci 
Eael I 

MWOI I I 

TaqI Sfcl CjePI | | | Mnll 

II I III I 

TTGTTCTATTCAGTCGAACTATAGTTGCTACrTTGGCCAATUVCTTTTCTA^ 

541 + + + + — + 600 

AACTU^GATAAGTCAGCTTGATATCAACGATGAAACCGGTTTTGAAAAGATTACTC 



BsaJI 
Hael I 
Hhal 



Hin4I 
NlalV 
BsaHI I 
Nan I 
Bani I j 
CjePI I I 
III 



601 



Alul 
CviJI 

Mnll I Acelll 
Fnu4HI I I Bbvl I Acll 

Styl TseX| I I BplI I I CjePI MaelX 

I MM I II I I 

AGGCGCCCTCCAAGGCAGCTCTATCAGTCTATCGCTAAACCCCAACCTAACGTTTGCCAA 

+ + + + + + SSO 

TCCGCGGGAGGTTCCGTCGAGATAGTCAGATAGCGATTTGGGGTTGGATTGCAAACGGTT 



Sthl32I 
Bspl286I 
Bmgl 
BseSI 
NlalVl 
BanI I I 
I II 



BslI 
BscGI 
Mnll 
BsaJI I 
BstDSI I 
Mnll I I 
II I 



MseX 
Tsp509I I 



AAACAAAGCAACGCAAAAAGGGGGTGCCCTCTATTCCACGGGAGGGATTACAATTAACAA 

661 + + + + + + 720 

TTTGTTTCGTTGCGTTTTTCCCCCACGGGAGATAAGGTGCCCTCCCTAATGTTAATTGT^ 



Fnu4HI 
SacII 
Taul 
Acil| 
MspAlI I 

Thai I CviJI 
BseMII Acil I I Nlalv) 
Hpyl88IX BsaJI j | Ecil | j 

S£aNl| BstDSI II Acil| || 

II I II II II 

TACGTTAAACTCAGCATCATTTTCTGAAAATACCGCGGCGAACAATGGCGGAGCCATTTA 

721 ^ + + + + --_-+ 780 

ATGCAATTTGAGTCGTAGTAAAAGACTTTTATGGCGCCGCTTGTTACCGCCTCGGTAAAT 



Msel 
Maell I Ddel 

I I I 
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CacSI 
Bfal 
Alul I 

CviJI I I Fnu4HI Bbvl Alul Maelll 

Nhel I I Tsel| Tsp509I CviJI Tsp45I 

III II I I I 
CACGGAAGCTAGCAGTTTTATTAGCAGCAACAAAGCAATTAGCTTTATAAACAATAGTGT 
781 + + + + + + 840 

GTGCCTTCGATCGTCAAAATAATCGTCGTTGTTTCGTTAATCGAAATATTTGTTATCACA 



CviJI 
NlalV 
BseMll 
Mnll 
Cjel 
Sfcl 
Alul 
CviJI 
Mwol 
BbvCI I 

BpulOlj II II II Taal Rsal Cjel 

Acil Ddel I j I II II Sfcl I TatI | CviJI | BsrI 

I II II II II II II II I 

GACCGCAACCTCAGCTACAGGGGGAGCCATTTACTGTAGTAGTACATCAGCCCCCAAACC 

841 + + + + + + 

CTGGCGTTGGAGTCGATGTCCCCCTCGGTAAATGACATCATCATGTAGTCGGGGGTTTGG 



900 



Hpyl88IX T6p509I Bfal 

Msel Sthl32I I BscGI Cjel | Spelj 

III I till 

AGTCTTAACTCTATCAGACAACGGGGAACTCAACTTTATAGGAAATACAGCAATTACTAG 

901 -r + ---+ + + + 960 

TGAGAATTGAGATAGTCTCTTGCCCCTTGACTTGAAATATCCTTTATGTCGTTAATGATC 



MboII 
Cjel Bfal I 
I 



Avail 
EC00109I 
PspSII 
Sau96I 
Sse8647I 
Hpyl78III I 
Mnll I I 
I i I 



Bpml 
Dral I 
Msel I I 



TGGTGGGGCGATTTATACTGACAATCTAGTTCa^lTC'rrCTGGAGGACCTACGCTTTTTA^ 

961 + + + + ---+ + 1020 

ACCACCCCGCTAAATATGACTGTTAGATCAAGAAAGAAGACCTCCTGGATGCGAAAAATT 
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Fig. 6 (con't) 



Muni 
Tsp509I 
Acelll 
BSU36I 
Ddel 
Mnll 



Alul 
CviJi 
PstI 
BsaXI I 
Fnu4HI I 
CviRI I I 
Tsel I I 
Sfcl III 



1 Bbvl I 

CviJI Sfcl I I I 1 I |Bsll| 

I 

AAACAACTCrrGGCTATGATACTGCAGCTCCCnrrAGGAGGAGCAATTGCGATTGCTGACT 

1021 + + + + + + 1080 

TTTGTTGAGACCGATACTATGACGTCGAGGGAATCCTCCTCGTTAACGCTAACGACTGAG 



Haeiv 
BseRI Hin4I 
PleljHinfl j 



Dpnl 

BstYI I BsmAI 

Sau3AI I Alwl CviJI I 

Hpyl78IIl| I Hinfl Plel j j Taal 

ill I I I I I 

TGGATCTTTGAGTCTTTCGGCrCTTGGTGGAGACATCACrT^^ 

1081 + + + + + + 1140 

ACCTAGAAACrCAGAAAGCCGAGAACCACCTCTGTAGTGAAAACTTCCTTTGTC 

Alul HpyiaSIX 
CviJI Hinfl j Apol Hpyl88IX 

MboII I TaqI I I Plel Tsp509I Msel | 

I I II I I 1 I I 

CAAAGGAGCTTCTTCGAGTCAGACCACTACCAGAAATTCTATTAACATCGGAAACACCAA 

1141 + + + + + + 1200 

GTTTCCTCGAAGAAGCTCAGTCTGGTGATGGTCrTTAAGATAATTGTAGCCTTTGTGGTT 



CviJI 
Cac8I 
Fnu4HI 
Alul I 



CviJI I 
MspAllj 
Pvullj 
Bce83I 1 I 
Rsal I I 
Bbvl BsrGI I II 
Ddel TatI |Tsel| 

I I I II 



Dpnl 
Sau3AI I 
Alwl I 



Smll Mnll 

II III 
TGCTAAGATTGTACAGCTGCGAGCCTCTCAAGGCAATACTATCTACTTCTATGATCCTAT 

1201 + + --- - + + + + 1260 

ACGATTCTAACATGTCGACGCTCGGAGAGTTCCGTTATGATAGATGAAGATACTAGGATA 
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B£aX 



Acelll 
Bbvl 
Hpyl88IX 
Alul 



CviJI 
PstI 
SfaNI 
TapRI 
Fnu4HI 
SfaNI 
CviRll 
Tsel I 
BtsI I I 
Sf cl I I 
II 



Ddel i 
I I 



Hpyl78III 
Msel Avail | 
BseMII I Sau96I | CviRI 
I I I I I 



AACy^CTAGCATCACTGCAGCTCTCTCAGATGCTCTAAACTTAAATGGTCCrrGACC^^ 

1261 + +- + + ---+ + X320 

TTGTTGATCGTAGTGACGTCGAGAGAGTCTACGAGATTTGAATTTACCAGGACTGGAACG 



HpylBSIX 
Bbvl 
Alul 

Bed CviJI 
Hinfl Drdll | Hin4l| 

Tfil CviRI NlalVl I HpylTBIII Hin4I | | | Bpml 

I I Ml I Ml 

AGGGAATCCTGCATATCAAGGAACCATCGTATTTTCTGGAGAGAAGCTCTCGGAAGCAGA 

1321 + + + + + + 1380 

TCCCTTAGGACGTATAGTTCCTTGGTAGCATAAAAGACCTCTCTTCGAGAGCCTTCGTCT 



Alul 
AlwNI 
CviJI 
Mwol 
PstI 
CviRI 
BplI 
Fnu4HI 
Sfcl 
Alul) 
AlwNI I 
Bpllj 
BstAPI I 

CviJI I I I I I Acil 
Mwolj II II Faul Mnll 

Tsellll II Tsp509I Sthl32l|Mnll| 

I II II 

AGCTGCAGAAGCTGATAATCTCAAATCTACAATTCAGCAACCTCTAACTCTTGCGGGAGG 

1381 + + + + + + 1440 

TCGACGTCTTCGACTATTAGAGTTTAGATGTTAAGTCGTTGGAGATTGAGAACGCCCTCC 
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Maelll 

Tsp45I CviJI 
Hinfl( BseRI 
BplI I I Neil I 

Hpyl78IIl||| Bfal ScrFI | 

Hin4I Msel Plel Ddel Sthl32I Mspl | | 

I I MM II I Ml 

GCAACTCTCrCTTATU^TCAGGAGTCACTCTAGTTGCTAAGTCCTTTTCGCAATCTCC 

1441 + + + + + + 1500 

CGTTGAGAGAGAATTTAGTCCTCAGTGAGATCAACGATTCAGGAAAAGCGTTAGAGGCCC 



Fokl 
Cjel 
NlalV 
Avail 



SfaNI 
Banll I 
Bspl2Q6I I 



Sau96I 
CviRI 
Mnll 
Mnll I 
Nlallll I 
BslI II I 
I M I 



Alwl 
BsaBI 
Dpnl 

MspAlI Sau3AI I 
Acil I Bed I I 
BsraFI 1 |Bsll| I I 
I 11 II II 



CrcrrACCCTCCTCATGGATGCAGGGACCACATTAGAAACCGCTGATGGGATCACTATCAA 

1501 + + + + + + 1560 

GAGATGGGAGGAGTACCTACGTCCCTGGTGTAATCTTTGGCGACTACCCTAGTGATAGTT 



Bspl286I 

Bsal CacSI 
Hinf I BsmAI Bmgl | 

Cjel Tfil Msel BseSi | Bsbl 

I II III 

TAATCTTGTTCTCAATGTAGATTCCrrTAAAAGAGACCAAGAAGGGCACG 

1561 + + - + + + + 1620 

ATTAGAACAAGAGTTACATCrAAGGAATTTTCTCrGGTTCTTCCCGTGC 



Tthlllll 
Maelll 
Taal 
Tsp45I 
Hpyl88IX I 
Mwol Cjel I I 

II I 



Hpyl78III 
Dpnl 
BstYI 



Dpnl 
Sau3AI I 
HpylTBIIlj I 
II I 

ACAAGCAAGTCAGACAGTCACTTTATCTGGATCGCTCTCTCTTGTAGATCCTTCTGGAM 

1621 + + + + + + 1680 

TGTTCGTTCAGTCTGTCAGTGAAATAGACCTAGCGAGAGAGAACATCTAGGAAGACCTTT 



SauBAI 
Cjel I 
Alwl I I 
Alwl I I I 
I III 



PflllOSI MboII 
AccI I BsmAI CjePr 

CjePl| I Bce83I 1 Srall [Mnll BsmAI 

Ml II III I 

TGTCTACGAAGATGTCTCTTGGAATAACCCTCAAGTCTTTTCTTGTCTCACTCTTACTGC 

1681 + + + + + + 1740 

ACAGATGCTTCTACAGAGAACCTTATTGGGAGTTCAGAAAAGAACAGAGTGAGAATGACG 
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1741 



Dpnl 
Sau3AI 
Fnu4HI 

Faul Alul I 

Sspl Alwl I 

Sthl32l| CviJlj 
Thai I I Tsel j 

SimI Acil I [| Bbvl Ddel [j 1 | Bfal 

I i I II I I II I I I 

TGACGACCCCGCGAATATTCACATCACAGACTTAGCTGCTGATCCCCTAGAAAAAAATCC 

+ + + + + + 1800 

ACTGCTGGGGCGCTTATAAGTGTAGTGTCrGAATCGACGACTAGGGGATCTTTTTTTAGG 



Tsp509I 

BsaJI I BciVI 

BciVI Styl I Mnll | 

I II II 

TATCCATTGGGGATACCAAGGGAATTGGGCATTATCTTGGCAAGAGGATACrrGCGACTAA 

1801 + + + + + + 1860 

ATAGGTAACCCCTATGGTTCCCTTAACCCGTAATAGAACCGTTCTCCTATGACGCTGATT 



Hinf I 
Fnu4HI I 

Tsel 
Plel 
II 



BspGI 
ScrFI I 
Bbvl I I 
ECORII I I BciVI 
III I 



BpulOI 
Ddel 
Hpyl78III 
Hinf I 
Tf il 
Hgal I 
Hpyl88IX I 
BseMII I I 



ATCCAAAGCMCGACTCTTACCTGGACAAAAACAGGATACAATCCGAATCOTGAGOT 

1861 - — + + + + + + 1920 

TAGGTTTCGTCGCTGAGAATGGACCTGTTTTTGTCCTATGTTAGGCTTAGGACTCGCAGC 



Dpnl 
NlalV 
BamHI 

Cjel BstYI 
Ddel I Sau3AI 
NlalV I I Alwl I 

I II II 



Hhal 
Hin4I I 
Alwl Cjel I j 
I III 



TGGAACCTTAGTTGCTAACACGCTATGGGGATCCrTTGTTGATGTGCGCTCCATACAAC^ 

i921 + + --- + + ---- + + 1980 

ACCTTGGAATCAACGATTGTGCGATACCCCTAGGAAACAACTACACGCGAGGTATGTTGT 

Fnu4HI Dpnl 

Taul BstYI I 

Alul Bce83I Hpyl78III Acil | Sau3AI j 

CviJI CviJI I Rsal Smll | Thai | SfatJI | | 

till II II III 

GCTTGTAGCCACTAAAGTACGCCAATCTCAAGAAACTCGCGGCATCTGGTGTGAAGGGAT 

1981 + + + -- + + + 2040 

CGAACATCGGTGATTTCATGCGGTTAGAGTTCTTTGAGCGCCGTAGACCACACTTCCCTA 
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Fig. 6 (con't) 



Xnml 
Alwl 
Hpyl78Iir I 

Taqi I I MslI 
MboIl| I I MboII BspMI ICviRI 

Mil I I I I 

CTCGAACTTCTTCCATAAAGATAGCACGAAGATAAATAAAGGTTTTCGCCACATAAGTGC 

2041 + + + + + + 2100 

GAGCTTGAAGAAGGTATTTCTATCGTGCTTCTATTTATTTCCAAAAGCGGTGTAT^^ 



CviJI 
PstI 
TspRI 
Pnu4HI 
CviRll 

Hpyl88IX Tselj 
Alul I BtsI I I 

Bsgl CviiJI I Sf cl I I 

I II I II 

AGGTTATGTTGTAGGAGCGACTACAACATTAGCTTCTGATAATCTTATCACTGCAGCCTT 

2101 --- - + + + + + + 2160 

TCCAATAC7UVCATCCTCGCTGATGTTGTAATCGAAGACTATTAGAATAGTGACGTCGGAA 



Bbvl Dpnl 
Tsp509I Sau3AI | Alul 

Sthl32l|Hpyl78III BsaBI | | CviJI 

II I II I I 

CTGCCyVATTATTCGGGAAAGATAGAGATCACTTTATAAATAAAAATAGAGC^ 

2161 + + + + + + 2220 

GACGGTTAATAAGCCCTTTCTATCTCTAGTGAAATATTTATTTT^ 



Alul 
CviJI 
Pnu4HI 
Bpml I 
CviRlj 
Mwol I 
Tselj 
II 



Tthllll 
SfaNI 
BbBl| 

Bbvl MboII I 

Bed Bf al I I 

I I II 

TGCAGCTTCrCTCCATCTCCAGCATCTAGCGACCrTGTCTTCTCC^ 

2221 + + + + --- + + 2280 

ACGTCGAAGAGAGGTAGAGGTCGTAGATCGCTGGAACAGAAGAGGTTCGAACAATGCGAT 



Mwol 
Maelll 
Alul I 
CviJI I 
Hindi I I I I 
I I I 



Hpyl88IX 
Dpnl 
BstYI 
Sau3AI 
ScrFI I 
EcoRII 1 



SfaNI 
CviJI I 
Fnu4HI I I 
Alwl Tsel| I I 
I II I I 



Sfcl 
BseMII 
Alul 
CviJI 
Dpnl 



Bbvl 



Sau3AI 
Hpyl88IX| 
Ddel I I 
Mnlll It 
II II 



CCTTCCTGGATCTGAAAGTGAGCAGCCTGTCCTCTTTGATGCTCAGATCAGCTATATCTA 

2281 + + + + + + 2340 

GGAAGGACCTAGACTTTCACTCGTCGGACAGGAGAAACTACGAGTCTAGTCGATATAGAT 



SUBSTrrUTE SHEET (RULE 26) 



wo 00/32794 



PCT/CA99/01147 



48/77 



Fig. 6 (con't) 



Banll 
BsiHKAI 
Bspl286I 
Sad 
BssSI I 
Alul I 
Cvi JI I 
Tthlllll I I 
CjePI Taqll | | | 
I I I M 

TAGTAAAAATACTATGAAAACCTATTACACCCAAGCACCAAAGGGAGAGAGCTCGTGGTA 



2341 



-+ 2400 



ATCATTTTTATGATACTTTTGGATAATGTGGGTTCGTGGTTTCCCTCTCTCGAGCACC^ 



2401 



Banll Nlalll 

BsiHKAI MbOlI 

Bspl286I CviJI 

Sad Mnll 

Alul I Msel I 

CviJI I TspRlj j 
CacBI t I Btsi I I I 

III rill 

TAATGACGGTTGCGCrCTGGAACTTGCGAGCTCCCTACCACACACrGCTT^ 

+ + --+ + + + 2460 

ATTACTGCCAACGCGAGACCTTGAACGCTCGAGGGATGGTGTGTGACGAAATTCGGTACr 



Hpyl78III 
Hhal I 
CjePI I I 
Taal I I I 
till 



Thai 
Afllll 
Mlul 
Earl I 
Bsal I I 
SimI BsmAI | j 
I I II 



Alul 
CviJI 
Hindi I I I 



Rsal 



I I 



GGGTCTCTTCCACGCGTATTTTCCTTTCATCAAAGTAGAAGCTTCGTACATACACCAAGA 

2461 +-- + + + + + 

CCCAGAGAAGGTGCGCATAAAAGGAAAGTAGTTTCATCTTCGAAGCATGTATGTGGTTCT 



2520 



Mae II 
HphI j 
Msel 

Dpnl Pad 
Alul BsaJI Sau3AI | Tsp509I | 

CviJI Maell Styl Rsal | | TaqI Acil Msel | j 

I I I I I I I I III 

TAGCTTCA/^GAACGTAATACTACCTTGGTACGATCTTTCGATAGCGGTGATTTAATTAA 

2521 + + + + + + 2580 

ATCGAAGTTTCTTGCATTATGATGGAACCATGCTAGAAAGCTATCGCCACTAAATTAATT 
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Hgal 
Hpyl78III 
HpyI78III 
TaqI 
Aval I 
Smllj 
Hin4I I I 
Hinfl I I I I Bpll 
BsmAI Hpyl78III |Xhol| j j Cac8l| 

BstnBI Tsp509I TaqI |TfiI Mil Hin4l| 

I I I I I III I II 

CGTCTCTGTGCCTATTGGAATTACCTTCGAGAGATTCTCGAGAAACGAGCGTGCGTCTTA 

2581 + + + + + + 2640 

GCAGAGACACGGATAACCTTAATGGAAGCrCTCTAAGAGCTCTTTGCTCGCACGCAGAAT 



Hpyl78III 

Alul Bsgl Hinfl j Alul 

CviJI Taal Maell BsaBI | Tfil j CviRI CviJI 

II I I I I I I I 

CGAAGCTACTGTCATCTACGTTGCCGATGTCTATCGTAAGAATCCTGACTGCACGACAGC 

2641 + + -- + + + + 2700 

GCTTCXSATGACAGTAGATGCAACGGCTACAGATAGCATTCTTAGGACTGACGTGCTGTCG 

Bce83I Hpyl78III 
Sfcl| Hinfl I Alul 

Acelll BssSI Mnll j | Tfil Smll j CviJI 

I I III I I I I 

TCTCCTAATCAACAATACCTCGTGGAAAACTACAGGAACGAATCTCTCAAGACAAGCTGG 

2701 +- + + + + + 2760 

AGAGGATTAGTTGTTATGGAGCACCTTTTGATGTCCTTGCrTAGAGAGTTCrGTT^^ 



MboII Maelll 

Dpnlj Tsp45I 
Hpyl88IX BstYI | | Hpyl78III 

Earl I Sau3AI j j Smll| 

Sapl I Hin4I I I I Alwl Mnll j j Maelll 

II II II I III 

TATCGGAAGAGCAGGGATCTTTTATGCCTTCTCTCCAAATCTTGAGGTCACAAGTAACCT 

2761 + + + + + + 2820 

ATAGCCTTCrCGTCCCTAGAAAATACGGAAGAGAGGTTTAGAACTCCAGTGTTCATTGGA 



Dpnl 
BstYI 
Sau3AI 
MboII 
i^ol 



Tsp509I 
BslI I 
Bce83l I I 

I 



Alul 
CviJI 
Fnu4HI I 
Tsel| I 
Alwl I I I 
I II I 



Dpnl 
Mnll 
Bgllll 
BstYI t 
Sau3AI I 
Bbvlj I 
CviRI I I I 
INI 



BsrI 



ATCTATGGAAATTCGTGGATCTTCACGCAGCTACAATGCAGATCTTGGAGGTAAGTTCCA 

2821 + + + + + 2880 

TAGATACCTTTAAGCACCTAGAAGTGCGTCGATGTTACGTCTAGAACCTCCATTCAAGGT 
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Fig. 6 (con't) 



Ddel 
Hpyl78III 
Dpnl 
BstYl 
Sau3AI 
Dpnl BseMII 
Sau3AI 1 Apol I 

Hpyl78IIl| I Tsp509I Alwl | 
Alwl II I Ddel I Ddel I I I I Bfal 

I II I I I II 
GTTCTAAAAGCGTTCCTGATCCCTTAGAAATTCTAAGAGATCCTGAGTGTATCTAGGGAC 

2881 + + + + + + 2940 

CTAGATTTTCGCAAGGACTAGGGAATCTTTAAGATTCTCTAGGACTCACATAGATCC^^ 



BsmFI 



TTCTCAAAGA 

2941 + 2950 

AAGAGTTTCT 
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Figure 7: Nucleotide and amino acid sequence of CPN 100639 

gtacgaagtt cttcacgaaa ttataataca aacctaggct ctaagttttg tttctagatt 60 

atcgaaaacg tgttaattaa ttgaacccaa gcatctttct atg aaa ata ccc ttg 115 

Met Lys He Pro Leu 
1 5 

cac aaa etc ctg ate tct teg act ett gte act ccc att eta ttg age 163 
His Lys Leu Leu He Ser Ser Thr Leu Val Thr Pro He Leu Leu Ser 
10 15 20 

att gea act tac gga gca gat get tct tta tec cct aca gat age ttt 211 
He Ala Thr Tyr Gly Ala Asp Ala Ser Leu Ser Pro Thr Asp Ser Phe 
25 30 35 

gat gga gcg ggc gge tct aca ttt act eca aaa tct aca gca gat gee 259 
Asp Gly Ala Gly Gly Ser Thr Phe Thr Pro Lys Ser Thr Ala Asp Ala 
40 45 50 

aat gga acg aac tat gtc tta tea gga aat gte tat ata aac gat get 307 
Asn Gly Thr Asn Tyr Val Leu Ser Gly Asn Val Tyr He Asn Asp Ala 
55 60 65 

ggg aaa ggc aca gea tta aca ggc tgc tgc ttt aca gaa act acg ggt 355 
Gly Lys Gly Thr Ala Leu Thr Gly Cys Cys Phe Thr Glu Thr Thr Gly 
70 75 80 85 

gat ctg aca ttt act gga aag gga tac tea ttt tea ttc aac acg gta 403 
Asp Leu Thr Phe Thr Gly Lys Gly Tyr Ser Phe Ser Phe Asn Thr Val 
90 95 100 

gat gcg ggt teg aat gca gga get gcg gea age aca act get gat aaa 451 
Asp Ala Gly Ser Asn Ala Gly Ala Ala Ala Ser Thr Thr Ala Asp Lys 
105 110 115 

gee eta ate ttc aca gga ttt tet aac ctt tec ttc att gca get cct 499 
Ala Leu He Phe Thr Gly Phe Ser Asn Leu Ser Phe He Ala Ala Pro 
120 125 130 

gga act ae"a gtt get tea gga aaa agt act tta agt tct gca gga gcc 547 
Gly Thr Thr Val Ala Ser Gly Lys Ser Thr Leu Ser Ser Ala Gly Ala 
135 140 145 

tta aat ctt aec gat aat gga acg att etc ttt age caa aac gtc tec 595 
Leu Asn Leu Thr Asp Asn Gly Thr He Leu Phe Ser Gin Asn Val Ser 
150 155 160 165 

aat gaa get aat aac aat ggc gga gcg ate ace aca aaa act ctt tct 64 3 
Asn Glu Ala Asn Asn Asn Gly Gly Ala He Thr Thr Lys Thr Leu Ser 
170 175 180 

att tet ggg aat ace tct tec ata ace ttc act agt aat age gca ciact o91 
He Ser Gly Asn Thr Ser Ser He Thr Phe Thr Ser Asn Ser Ala Lys 
185 190 195 
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aaa tta ggt gga gcg ate tat age tct geg get gca agt att tea gga 739 
Lys Leu Gly Gly Ala lie Tyr Ser Ser Ala Ala Ala Ser lie Ser Gly 
200 * 205 210 

aac aee ggc cag tta gtc ttt atg aat aat aaa gga gaa aet ggg ggt 787* 
Asn Thr Gly Gin Leu Val Phe Met Asn Asn Lys Gly Glu Thr Gly Gly 
215 220 225 

ggg get ctg ggc ttt gaa gee age tec teg att aet caa aat age tee 835 
Gly Ala Leu Gly Phe Glu Ala Ser Ser Ser lie Thr Gin Asn Ser Ser 
230 235 240 245 

ett tte ttc tct gga aac act gea aea gat get gea ggc aag ggc ggg 883 
Leu Phe Phe Ser Gly Asn Thr Ala Thr Asp Ala Ala Gly Lys Gly Gly 
250 255 260 

gee att tat tgt gaa aaa aea gga gag aet cet act ctt aet ate tet 931 
Ala lie Tyr Cys Glu Lys Thr Gly Glu Thr Pro Thr Leu Thr lie Ser 
265 270 275 

gga aat aaa agt ctg ace ttc gee gag aac tct tea gta act caa ggc 979 
Gly Asn Lys Ser Leu Thr Phe Ala Glu Asn Ser Ser Val Thr Gin Gly 
280 285 290 

gga gca ate tgt gee cat ggt eta gat ctt tee get get ggc eet ace 1027 
Gly Ala lie Cys Ala His Gly Leu Asp Leu Ser Ala Ala Gly Pro Thr 
295 300 305 

eta ttt tea aat aat aga tgc ggg aac aea get gea ggc aag ggc ggc 1075 
Leu Phe Ser Asn Asn Arg Cys Gly Asn Thr Ala Ala Gly Lys Gly Gly 
310 315 320 325 

get att gea att gee gac tct gga tet tta agt etc tct gea aat caa 1123 
Ala lie Ala lie Ala Asp Ser Gly Ser Leu Ser Leu Ser Ala Asn Gin 
330 335 340 

gga gac ate aeg tte ctt ggc aac act eta ace tea ace tec geg eca 1171 
Gly Asp lie Thr Phe Leu Gly Asn Thr Leu Thr Ser Thr Ser Ala Pro 
345 350 355 

aea teg aea egg aat get ate tac etg gga teg tea gca aaa att aeg 1219 
Thr Ser Thr Arg Asn Ala lie Tyr Leu Gly Ser Ser Ala Lys lie Thr . 
360 365 370 

aac tta agg gca gee caa ggc caa tet ate tat ttc tat gat ecg att 1267 
Asn Leu Arg Ala Ala Gin Gly Gin Ser lie Tyr Phe Tyr Asp Pro lie 
375 380 385 

gca tct aac acc aea gga get tea gac get ctg acc ate aac caa eeg 1315 

Ala Ser Asn Thr Thr Gly Ala Ser Asp Val Leu Thr He Asr. Gin Pro 
390 395 400 405 

gat age aac teg eet tta gat tat tea gga aeg att gta tct tct ggg 1363 
Asp Ser Asn Ser Pro Leu Asp Tyr Ser Gly Thr He Val Phe Ser Gly 
410 41S 420 
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Fig. 7 (con't) 

gaa aag etc tct gca gat gaa gcg aaa get get gat aac ttc aca tot 1411 
Glu Lys Leu Ser Ala Asp Glu Ala Lys Ala Ala Asp Asn Phe Thr Ser 
425 430 435 

ata tta aag caa cca ttg get eta gee tet gga ace tta gca etc aaa 1459 
lie Leu Lys Gin Pro Leu Ala Leu Ala Ser Gly Thr Leu Ala Leu Lys 
440 445 450 

gga aat gtc gag tta gat gte aat ggt ttc aca cag act gaa ggc tct 1507 
Gly Asn Val Glu Leu Asp Val Asn Gly Phe Thr Gin Thr Glu Gly Ser 
455 460 465 

aca etc etc atg eaa cea gga aca aag etc aaa gca gat act gaa get 1555 
Thr Leu Leu Met Gin Pro Gly Thr Lys Leu Lys Ala Asp Thr Glu Ala 
470 475 480 485 

ate agt ctt aec aaa ctt gtc gtt gat ett tct gee tta gag gga aat 1603 
lie Ser Leu Thr Lys Leu Val Val Asp Leu Ser Ala Leu Glu Gly Asn 
490 495 500 

aag agt gtg tec att gaa aea gea gga gee aac aaa act ata act eta 1651 
Lys Ser Val Ser lie Glu Thr Ala Gly Ala Asn Lys Thr He Thr Leu 
505 510 515 

ace tet ect ctt gtt ttc caa gat agt age ggc aat ttt tat gaa age 1699 
Thr Ser Pro Leu Val Phe Gin Asp Ser Ser Gly Asn Phe Tyr Glu Ser 
520 525 530 

cat acg ata aac caa gee ttc aeg cag cet ttg gtg gta ttc act get 1747 
His Thr He Asn Gin Ala Phe Thr Gin Pro Leu Val Val Phe Thr Ala 
535 540 545 

get act get get age gat att tat ate gat gcg ctt etc act tct cca 1795 
Ala Thr Ala Ala Ser Asp He Tyr He Asp Ala Leu Leu Thr Ser Pro 
550 555 560 565 

gta caa act eca gaa ect cat tae ggg tat cag gga cat tgg gaa gee 1843 
Val Gin Thr Pro Glu Pro His Tyr Gly Tyr Gin Gly His Trp Glu Ala 
570 575 580 

act tgg gea gae aea tea act gca aaa tea gga act atg act tgg gta 1891 
Thr Trp Ala Asp Thr Ser Thr Ala Lys Ser Gly Thr Met Thr Trp Val 
585 590 595 

act aeg ggc tae aac cet aat ect gag egt aga get tec gta gtt ecc 1939 
Thr Thr Gly Tyr Asn Pro Asn Pro Glu Arg Arg Ala Ser Val Val Pro 
600 605 610 

gat tea tta tgg gea tee ttt act gae att ege act eta cag cag ate 1987 

Asp Ser Leu Trp Ala Ser Phe Thr Asp He Arc Thr Leu Gin Gin He 
615 ' 620 " £25 

atg aea tet caa gcg aat agt ate tat cag caa cga gga etc tag gca 2035 
Met Thr Ser Gin Ala Asn Ser He Tyr Gin Gin Arg Gly Leu Trp Ala 
630 635 640 645 
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Fig. 7(con't) 

tea gga act gcg aat ttc ttc cat aag gat aaa tea gga act aac caa 2083 
Ser Gly Thr Ala Asn Phe Phe His Lys Asp Lys Ser Gly Thr Asn Gin 
650 655 660 

gca ttc cga cat aaa age tac ggc tat att gtt gga gga agt get gaa 213i 
Ala Phe Arg His Lys Ser Tyr Gly Tyr He Val Gly Gly Ser Ala Glu 
665 670 675 

gat ttt tct gaa aat ate ttc agt gta get ttc tgc cag etc ttc ggt 2179 
Asp Phe Ser Glu Asn He Phe Ser Val Ala Phe Cys Gin Leu Phe Gly 
680 685 690 

aaa gat aaa gac etg ttt ata gtt gaa aat ace tct cat aac tat tta 2227 
Lys Asp Lys Asp Leu Phe He Val Glu Asn Thr Ser His Asn Tyr Leu 
695 700 705 

gcg teg eta tac etg caa cat cga gca ttc eta gga gga ctt ccc atg 2275 
Ala Ser Leu Tyr Leu Gin His Arg Ala Phe Leu Gly Gly Leu Pro Met 
710 715 720 725 

ecc tea ttt gga agt ate acc gac atg etg aaa gat att cct etc att 2323 
Pro Ser Phe Gly Ser He Thr Asp Met Leu Lys Asp He Pro Leu He 
730 735 740 

ttg aat gee cag eta age tac age tac act aaa aat gat atg gat act 2371 
Leu Asn Ala Gin Leu Ser Tyr Ser Tyr Thr Lys Asn Asp Met Asp Thr 
745 750 755 

cgc tat act tec tat cct gaa get caa ggc tct tgg acc aat aac tct 2419 
Arg Tyr Thr Ser Tyr Pro Glu Ala Gin Gly Ser Trp Thr Asn Asn Ser 
760 765 770 

ggg get eta gag etc gga gga tct etg get eta tat etc cct aaa gaa 2467 
Gly Ala Leu Glu Leu Gly Gly Ser Leu Ala Leu Tyr Leu Pro Lys Glu 
775 780 785 

gca ccg ttc ttc cag gga tat ttc cec ttc tta aag ttc cag gca gtc 2515 
Ala Pro Phe Phe Gin Gly Tyr Phe Pro Phe Leu Lys Phe Gin Ala Val 
790 795 800 805 

tac age cgc caa caa aac ttt aaa gag agt ggc get gaa gee egt get 2563 
Tyr Ser Arg Gin Gin Asn Phe Lys Glu Ser Gly Ala Glu Ala Arg Ala 
810 815 820 

ttt gat gat gga gac eta gtg aac tgc tct ate cct gtc ggc att egg 2 611 
Phe Asp Asp Gly Asp Leu Val Asn Cys Ser He Pro Val Gly He Arg 
825 830 835 

tta gaa aaa ate tec gaa gat gaa aaa aat aat ttc gag att tct eta 2659 
Leu Glu Lys lie Ser Glu Asp Olu hys Asn Asn Phe Glu He Ser Leu 
840 345 850 

gcc tac att ggt gat gtg tat cgt aaa aat cce cgt teg cat act tct 2707 
Ala Tyr He Gly Asp Val Tyr Arg Lys Asn Pro Arg Ser Arg Thr Ser 
S55 860 £65 
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Fig. 7 (con't) 

eta atg gtc agt gga gcc tct tgg act teg eta tgt aaa aac etc gca 2755 
Leu Met Val Ser Gly Ala Ser Trp Thr Ser Leu Cys Lys Asn Leu Ala 
870 875 880 885 

ega caa gcc ttc tta gca agt get gga age cat ctg act etc tec cet 2803 
Arg Gin Ala Phe Leu Ala Ser Ala Gly Ser His Leu Thr Leu Ser Pro 
890 895 900 

eat gta gaa etc tct ggg gaa get get tat gag ctt egt ggc tea gca 2851 
His Val Glu Leu Ser Gly Glu Ala Ala Tyr Glu Leu Arg Gly Ser Ala 
905 910 915 

cac ate tac aat gta gat tgt ggg eta aga tae tea ttc tagtteetac 2900 
His lie Tyr Asn Val Asp Cys Gly Le\i Arg Tyr Ser Phe 
920 925 930 

tttcctccct aaacttttag ggaggaatte ttataaaaae cetgtagatt cttaacttac 2960 

tagtctetce ttteetcttg ctttetttaa tttattgeag . 3000 
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Figure 8: Restriction enzyme analysis of CPN100639 



Tsp509I 
MboII Hpyl78III I 
Rsal XmnX | | 

I III 



Ddel 
CviJI 
Bfal 
Avrll I 
Bsa JI I 
Styll 
II 



Hpyl78III 
Bfal I 
Xbal| j 



rCTAGATT 



GTACGAAGXTCrrrCACGAAATTATAATACAAACCTAGGCTCTAAGT 

1 + + + + + + 60 

CyiTGCrrCAAGAAGTGCITTAATATTATGTTTGGATCCGAGAT^ 



TspEOSI 
Mselj 
Pad I 
Vspl I 
Tsp509I I I 
Af nil I I I Tthlllll 

Taqi Maell Mselj jj SfaNI | CviRI 

I I II II II I 

ATCGAAAACGTGTTAATTAATTGAACCCAAGCATCTTTCTATGAAAATACCCm^ 

61 + + + + + + 120 

TAGCTTTTGCACAATTAATTAACTTGGGTTCGTAGAAAGATACTT^ 



Pier 
Dpnl 
Sau3AI ( 
Hpyl78IIl| I 
MboII I 



Maelll 
Tsp45I 
PshAI 
Karl I 
Hinfl I 
TaqI I I 



CviRI 
BsrDI I SfaNI 



I 



ACTCCTGATCTCTTCGACTCTTGTCACTCCCATTCTATTGAGCATTGC^ . 

121 + + + + --- + ^+ 180 

TGAGGACTAGAGAAGCTCAGAACAGTGAGGGTAAGATAACTCGTAACGTTGAATGCCTCG 



CviJI 
FI1U4HI I 
Taul 
Acil 
Cac8I 
Acil 



Sfcl 



BsrBI 
Bed 
Faul 
Sthi32i I 
Alul I I 
CviJI 1. 1 



BsaXI 



AGATGCTTCTTTATCCCCTACAGATAGCTTTGATGGAGCGGGCGGCTCTACATTTACTCC 

181 + 4- + + + + 240 

TCTACGAAGAAATAGGGGATGTCTATCGAAACTACCTCGCCCGCCGAGATGTAAATGAGG 
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Fig. 8 (con't) 



Sfcl 

SfaNI I Hpyl78III SfaNI 

II. II 
AAAATCTACAGCAGATGCCAATGGAACGAACTATGTCTTATCAGGAAATGTCTATATAAA 

241 + 4- + + --+ + 300 

TTTTAGATGTCGTCTACGGTTACCTTGCrrTGATACAGAATAGTCCTTTACTVGATATAT^ 



301 



Fnu4HI 
Tsel I 

Msel Fnu4HI | | 
Bbvl I Cyi JI | j | 
Bbvl I I Tsel I I I Sthl32I 

III II 
CGATGCTGGGAAAGGCT^OVGCATTAACAGGCTGCrrocriTTACAGAAAC^ 

+ + + + + + 

GCTACGACCCTTTCCGTGTCGTAATTGTCCGACGACGAAATGTCITTGATGCCCACTAGA 



Hpyl88IX 
Dpnl 
SauSAI 
BscGI 
Bsp24I 
Cjel 
CjePI 



360 



361 



Faul NspV 

Taal TaqI 

Cjel Sthl32l| Bbvl I 

Bsri CjePI I Bsbl !| Acelll | j 

HphI BciVI I Bsp24l|| SfaNI | || Acil ||| CviRI 

III III I I II I . Ill I 

GACATTTACTGGAAAGGGATACTCATTTTCATTCAACACGGTAGATGCGGGTTCGAATGC 

+ + + + + + 

CTGTAAATGACCTTTCCCTATGAGTAAAAGTAAGTTGTGCCATCTACGCCCAAGCTTACG 



420 



CacBI 
Fnu4HI 
Taul 
Acil 
Fnu4HI 
Alul I 
AlwNlj 
BstAPI I 
CviJlj 
Mwolj 
Tsel I 
BsmI I I 
I II 

AGGAGCTGCGGCAAGCACAACTGCTGATAAAGCCCTAATCTTCACAGGATTTTCTAACCT 

421 + + + + + + 

TCCTCGACGCCGTTCGTGTTGACGACTATTTCGGGATTAGAAGTGTCCTAAAAGATTGGA 



CviJI 
MboIl| 
Mwol 
Tthllllll 



480 
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EC057I 
ScrFI 
AlwNI 
EcoRII 
Alul 
CviJI 

Fnu4HI I II I Acelll Rsal 
CviRI I I III Sfcl Hpyl78III Seal CviRI 

BsrDI Tselj I III Bbvl|TaaI | TatI |MseI Sfcl | 

I M I III Ml I III 11 

TTCCTTCATTGCAGCTCCTGGAACTACAGTTGCTTCAGGAAAAAGTACTTTAAGTTCTGC 

481 + + + + + + 540 

AAGGAAGTAACGTCGAGGACCTTGATGTCAACGAAGTCCTITTTCATGAAATTCAAGACG 



Msel 
Cjel 
CviJI I 
NlalVj I 
PstI I I I 



CjePI 
Hinfl I 
Tfil I 



BsmAI 
BsmBI 



CviJI 

Cjel I Maell 

II I I 

AGGAGCCTTAAATCTTACCGATAATGGAACGATTCTCTTTAGCCAAAACGTCTCCAATGA 

541 + + + + + + 600 

TCCTCGGAATTTAGAATGGCTATTACCTTGCTAAGAGAAATCGGTTTTGCAGAGGTTAC^ 



Dpnl 
Sau3AI I 
CjePI 
Ecil I 

Alul HphI I I I CjePI 

CviJI CjePI Acilj III Mbolll 

I I II M I II 

AGCTAATAACAATGGCGGAGCGATCACCACAAAAACTCTTTCTATTTCTGGGAATACCTC 

601 + + + + --- + + 660 

TCGATTATTGTTACCGCCTCGCTAGTGGTGTTTTTGAGAAAGATAAAGACCCTTATGGAG 



Bbvl 

Sfcl Fnu4HI 

Mnll Bfal Tsp509I Dpnl | Alul Taul 

Earl I Spell Hhal | Sau3AI | j CviJI Acil| 

I I II II III I II 

TTCTATAACCTTCACTAGTAATAGCGCAAAAAAATTAGGTGGAGCGATCTATAGCTCTGC 

661 + + + + + + 

AAGATATTGGAAGTGATCATTATCGCGTTTTTTTAATCCACCTCGCTAGATATCGAGACG 



720 



CviRI 
Pnu4HI I 
CviJI I I 
Tsel 



Hpyl78III 

I 



Bsri 
CviJI 
Haelll 
Eael 
Gdill 
Mspl I 
BsrFI I I 

III 



Cjel 



GGCTGCAAGTATTTCAGGAAACACCGGCCAGTTAGTCTTTATGAATAATAAAGGAGAAAC 

721 + + + - + + + 780 

CCGACGTTCATAAAGTCCTTTGTGGCCGGTCAATCAGAAATACTTATTATTTCCTCTTTG 
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Fig. 8 (con't) 



Alul 
CviJI 
Cac8I 
CviJI 



Banll Cjel | 

BsrX Bspl286I BseRX| | 
Bmrl I CviJI jCviJi ( j j 

II M I II I 



Mnll 
Acelll I 
Taqi I I 



Mboll 
Alull 
CviJI I 

II 



TGGGGGTGGGGCTCTGGGCTTTGAAGCCAG CTCCTCGATTACTCAAAATAGCTCCCTTTT 

781 + + - + + + + 840 

ACCCCCACCCCGAGACCCGAAACTTCGGTCGAGGAGCTAATGAGTTTTATCGAGGGAAAA 



Faul 
Cac8I 
PstI 
Sthl32I 
CviRI 
Fnu4HI 
Sfcl 
AlwNI 



Tsel 
BstAPI 
TspRI I 
CviRI 
BtsI I 
Bbvl I I 
Hpyl78III SfaNI j j |MwoI 
I Ml 



CviJI 
Haelll 
NlalV 
Sau96I I 
Acil I I 
Cjel I I I 



CTTCTCTGGAAACACTGCAACAGATGCTGCAGGCAAGGGCGGGGCCATTTATTGT^^ 

841 + + + + + + 900 

GAAGAGACCTnTGTGACGTTGTCTACGACGTCCGTTCCCGCCCCGGTAAATAACACTTOT 

Hin4I 

Cjel I EC057I 

Plel Cjel II Cjel I Bce83I 

BsmAI I Hinfl | j Hpyl78III Hpyl88IX | j MboII | 

II III I III II 

AACAGGAGAGACTCCTACTCTTACTATCTGTGGAAATAAAAGTCTGACCTTCGCCGAGAA 

901 + ■»- + + --- + + 960 

TTGTCCTCTCTGAGGATGAGAATGATAGAGACCTTTATTTTCAGACTGGAAGCGGCTCT^ 
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Fig. 8 (con't) 



Dpnl 
Bglll 
BstYI 
Sau3AI 
Hpyl78III 
Bfal 
Bbvl 
Xbal 
Nlalll 



BsaJI 
Bspl286I 
BstDSI 
NCOI 
Styl 
Bmgl 
BseSI 
BstAPI I 
Mwol 



Earl Ecil 
Maelll Smll Acil | 
I I II 

CTCTTCAGTAACTCyUlGGCGGAGCAATCTGTGCCCATGGTCrAGATCTTTCCGCTC 
961 + + + + + 1020 

GAGAAGTCATTGAGTTCCGCCTCGTTAGACACGGGTACCAGATCTAGAAAGGCGACGACC 



CviJI 
Haelll 
Sau96I 
CacBI 
Pnu4HI 
MspAlI I 
Tsel I 
Acil I I 
i II 



Faul 
Sthl32l| Acil 
SfaNI I I Bbvl I 



CacBI 
PstI 
CviRI 
Fnu4HI 
Sfcl 
Alul| 
CviJI I 
MspAlI I 
Pvullj 
Tsel 



Fnu4HI 
Taul 
Acil|HaeII 
Mwol I Hhal I 



CCCTACCCTATTTTCAAATAATAGATGCGGGAACACAGCTGCAGGCAAGGGCGGCGCTAT 

1021 + + + + + + 1080 

GGGATGGGATAAAAGTTTATTATCTACGCCCTTGTGTCGACGTCCGTTCCCGCCGCGATA 



Dpnl 
BstYI 
Sau3AI 
Hpyl78IIl| 
HaelVj I 
Hiii4l| I 
Hinf I I I I 
I III 



Plel 
Muni 1 

T3P509I I Hin4lji j Alwl 

CviRI I 1 Hinf I | | | |MseI 
II I 

TGCAATTGCCGACTCTGGATCTTTAAGTCTCTCTGCAAATCAAGGAGACATCACGTTCCT 

1081 + + + + + + 1140 

ACGTTAACGGCTGAGACCTAGAAATTCAGAGAGACGTTTAGTTCCTCTGTAGTGCAAGGA 



BsmAI 
CviRI I 
BsmAI I jcjel 



BsaJI 
Maell Styl 

I I 
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Fig. 8 (con't) 



Bsbl 



Cjel 



Mnll 
HhaX 
Thai I 
Mnll I I 
Acilll I 
III I 



Taqi 



Sau3AI 
ScrFI 
BsaJI I 
EcoRIlj I 
BsmI I I j 
I Ml 



TGGCAACACTCTAACCTCAACCTCCGCGCCAACATCGACACGGAATGCTATCTACCTGGG 

1141 + + + + + + 1200 

ACCGTTGTGAGATTGGAGTTGGAGGCGCGGTTGTAGCTGTGCCTTACGATAGATGGACCC 



Bbvl 
CviJI 
Hael 
Haelll 
BsaJI 
Styl 
CviJI 



Dpnl 



Tsp509I 
Alwl I 
I I 



Fnu4HI 
Tsel 
CjePI 
Msel I 
Afllll I 
Stnllj I 

II I 



Dpnl 
CjePI I 
Sau3Al| I 
Alwl I I I 

I III 



ATCGTCAGCAAAAATTACGAACTTAAGGGCAGCCCAAGGCCAATCTATCTATTTCTAT^ 

1201 ---+ + + + + + 1260 

TAGCAGTCGTTTTTAATGCTTGAATTCCCGTCGGGTTCCGGTTAGATAGATAAAGATACT 



HpylSBIX 
Mae II 
HpylSSIX I 

EC057I Alul I I j Mspl 

HpylSBIX CviRI. ISfaNI CviJI | j | Bed BsaWI | Hin4I 

I III III 

TCCGATTGCATCTAACACCACAGGAGCTTCAGACGTTCTGACCATCAACCAACCGGATAG 

1261 + + + --- + + + 1320 

AGGCTAACGTAGATTGTGGTGTCCTCGAAGTCTGCAAGACTGGTAGTTGGTTGGCCTATC 



Bbvl 
PstI 

Alul CviRI I 

Hpyl7eiII CviJI Sfcl | j 

I I M I 

CAACTCGCCTTTAGATTATTCAGGAACGATTGTATTTTCTGGGGAAAAGCTCTCTGCAGA 

1321 + + + + + + 1380 

GTTGAGCGGAAATCTAATAAGTCCTTGCTAACATAAAAGACCCCTTTTCGAGAGACGTCT 

Fnu4Hl 
Alul I 

CviJI I CviJI 
Mwol I Bf al I 

Mwol Tsel 1 Msel CviJI | j 

I II III! 

TGAAGCGAAAGCTGCTGATAACTTCACATCTATATTAAAGCAACCATTGGCTCTAGCCTC 

1381 + + + --- + + + 1440 

ACTTCGCTTTCGACGACTATTGAAGTGTAGATATAATTTCGTTGGTAACCGAGATCGGAG 
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Fig. 8 (con't) 

BpulOI 
Ddel 
Mnll 
NlalV I 

Hpyl78III I I TaqI 
III I 

TGGAACCTTAGCACrrCAAAGGAAATGTCGAGTTAGATGTCAATGGTTTCACACAGAC^ 

1441 + + + + + + 1500 

ACCTTGGAATCGTGAGTTTCCTTTACAGCTCAATCTACAGTTACCAAAGTGTGTCT6ACT 

ScrFI 
Mnll 
EcoRII 
CviRI I 

CviJI Nlalll I I I Alul Alul 

BseRl| Eco57l| III CviJI AlwNI CviJI 

III 
AGGCTCTACACTCCTCATGCAACCAGGAACAAAGCTCAAAGCAGATACTGAAGCTATCAG 

1501 + + + + + + 1560 

TCCGAGATGTGAGGAGTACGTTGGTCCTTGTTTCGAGTTTCGTCTATGACTTCGATAGTC 

Dpnl Ddel 
EC057I Sau3AI I Mnll | 

I 11 II 

TCTTACCAAACTTGTCGTTGATCTTTCTGCCTTAGAGGGAAATAAGAGTGTGTCCAOT 

1561 + + + + + + 1620 

ACAATGGTTTGAACAGCAACTAGAAAGACGGAATCTCCCTTTATTCTCACACAGGTAACT 

CviJI 

KlaIV( BseRI Mnll Mnll Acil 

II I I I I 

AACAGCAGGAGCCAACAAAACTATAACTCTAACCrCTCCTCTTGTTTTCCAAGATAGTAG 

1621 + + + + + + 1680 

TTGTCGTCCTCGGTTGTTTTGATATTGAGATTGGAGAGGAGAACAAAAOT 

Tsp509I CviJI 
Fnu4HI I Fnu4HI I Bbvl 

Taul I CviJI CviJI Tsel | j Bbvl | 

II I I II I I I 

CGGCAA rrnT ATGAAAGCCATACGATAAACCAAGCCTTCACGCAGCCTTTGGTGGTATT 

1681 :-- + + + + + + 1740 

GCCGTTAAAAATACTTTCGGTATGCTATT1XK3TTCGGAAGTGCGTCGGAAACCACCATAA 

CacBI 
Bfal 
TspRl Nhel 
Pnu4Hl| Fnu4HI I 
Tsel I I Mwol I III Bpml Rsal 

BtsI III Tselj III Clal I TatI | 

Bbvl I I I |MwoI II III SfaNI TaqI |HhaI Bpml BsrI | | 

I I III I II III I III I III 

CACTGCTGCTACTGCTGCTAGCGATATTTATATCGATGCGCTTCTCACTTCTCCAGTACA 

1741 + + + + + + 1800 

GTGACGACGATGACGACGATCGCTATAAATATAGCTACGCGAAGAGTGAAGAGGTCATGT 



SUBSTITUTE SHEET (RULE 26) 



wo 00/32794 



PCT/CA99/01147 



63/77 



Fig. 8 (con't) 



Mnll 

Sthl32I BscGl| BsmFI Bgll 

Hpyivaill I BsII II CjePI CviJil Mwol 

I I I II I II I 

AACTCCAGAACCTCATTACGGGTATCAGGGACATTGGGAAGCCACinXSGGCAGAC^ 

1801 + + + + + + 1860 

TTGAGGTCTTGGAGTAATGCCCATAGTCCCTGTAACCCTTCGGTGAACCCGTCTGTGTAG 



BpulOI 

Hpyl78III Ddel 
CviRI I Maelll CviJI Hpyl78III 

CjePI I I Sthl32l| BscGI iBseMII | 

II I . II III I 

AACTGCT^AAATCAGGAACTATGACTTGGGTAACTACGGGCTACAACCCTAATCCTGAGCG 

1861 - + + + + + + 1920 

TTGACGTTTTAGTCCTTGATACrGAACCCATTGATGCCCGATGTTGGGATTAGGACTCGC 

Sthl32I 
Hinfl 
Tfil 
Foklj 

Alul Hpyl78IIl| I I CjePI BceSBI 

CviJI CjePI III I SfaNI | Sfcl | 

I I III I II II 

TAGAGCTTCCGTAGTTCCCGATTCATTATGGGCATCCTTTACTGACATTCGCACTCTACA 

1921 + + + + + + 1980 

ATCTCGAAGGCATCAAGGGCTAAGTAATACCCGTAGGAAATGACTGTAAGCGTGAGATGT 



Plel 

Smll Mnll I Hinfl Hpyl78III 



NlaXII 
Hpyl78III 
Real I 
Dpnlj I 
Sau3AI I I I 
I II I 

GCAGATCATGACATCTCAAGCGAATAGTATCTATCAGCAACGAGGACTCTGGGCATCAGG 

1981 + + --+ + + + 2040 

. CGTCTAGTACTGTAGAGTTCGCTTATCATAGATAGTCGTTGCTCCTGAGACCCGTAGTCC 

Apol Alul 

Tsp509I CviJI 

MboII I Tthlllll 1 

AlwNI II Hpyl88IX j | 

SfaNI I jxmnl Hpyl78III Bsml | Mmel j | 

Ml! J I I II I 

AACTGCGAATTTCTTCCATAAGGATAAATCAGGAACTAACCAAGCATTCCGACATAAAAG 

2041 --+ + + + + ----+ 2100 

TTGACGCTTAAAGAAGGTATTCCTATTTAGTCCTTGATTGGTTCGTAAGGCTGTATT^ 

MboII 

MboII I Alul 

Mmel Hpyl88IX|l CviJI 

CviJI Mnll I Beef I Eco57I ||| Eco57I TspRI | 

I II I ! Ill I II 

CTACGGCTATATTGTTGGAGGAAGTGCTGAAGATTTTrCTGAAAATATCTTCAGTGTAGC 

2101 + + + + + + 2160 

GATGCCGATATAACAACCTCCTTCACGACTTCTAAAAAGACTTTTATAGAAGTCACATCG 



SUBSmrUTE sheet (rule 25) 



wo 00/32794 



PCT/CA99/01147 



64/77 



Fig. 8 (con*t) 



Alul 

CviJI Acelll 

Cac8I t Earl | 

Mboll I I Sapl I Hgal 

I I i I I I 

TTTCTGCCy^GCrCTTCGGTAAAGATAAAGACCTGTTTATAGTTGAAAAT^ 

2161 + + + + + + 2220 

AAAGACGGTCGAGAAGCCATTTCTATTTCTGGACAAATATCAACTTTTATGGAGAGTATT 

Bfal 
Avrlll 
BsaJI I 
Stylt 

CviRI Taqi Mnll | \ 
Mnll Mwol |BspMl|BsmI | || Nlalll 

I I I II I I II I 
CTATTTAGCGTCGCTATACCTGCAACATCGAGCATTCCTAGGAGGACTTCCCATGCCCTC 

2221 + + + + + + 2280 

GATAAATCGCAGCGATATGGACGTTGTAGCTCGTAAGGATCCTCCTGAAGGGTACGGGAG 

Alul 
CviJI 
Bpull02I 
Ddel 

Mnll Alul I 

HphI I Nlalll CviJI I 

BslI I I MslI Nspl Mnll BsmI | j 

III II I I II 

ATTTGGAAGTATCACCGACATGCTGAAAGATATTCCTCTCATTTTGAATGCCCAGCTAAG 

2281 + + + + + + 2340 

TAAACCTTCATAGTGGCTGTACGACTTTCTATAAGGAGAGTAAAACTTACGGGTCGATTC 

Alul snai 
CviJI HpylVBIII Alul I 

Sfcl 1 BciVI Bce83I | CviJI | CviJI 

II I I I II I 
CTACAGCTACACTAAAAATGATATGGATACTCGCTATACTTCCTATCCTGAAGCTCAAGG 

2341 + + + + + + 2400 

GATGTCGATGTGATTTTTACTATACCTATGAGCGATATGAAGGATAGGACTTCGAGTTCC 
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Dpnl 
BstYI 
Sau3AI 
Hpyl88IX 
Banll 



EC057I 
Avail 
Sau96I 



BsiHKAI 
Bspl286I 
Sad 
Alul 
CviJI 
Mnll 
CjePI 
Hpyl78III 
B£al 
Banll I 
B3pl286I| 
Xbalj 
CviJI I I 
I II 



CviJI 
Alwl| 

II 



CjePI 
I 



CTCTTGGACCAATAACTCTGGGGCTCTAGAGCTCGGAGGATCTCTGGCTCTATATCTCCC 

2401 + + + + --- + + 2460 

GAGAACCTGGTTATTGAGACCCCGAGATCTCGAGCCTCCTAGAGACCGAGATATAGAGGG 



ScrFI 
BsaJI I 
EcoRIlll 
MboII Taal | | | 



ScrFI 
Msel EcoRII I 



Sfcl 
AccllCviJI 



TAAAGAAGCACCGTTCTTCCAGGGATATTTCCCCITCTTAAAGTTCCAGGCAGTCTACAG 

2461 +---- + + + + + 2520 

ATTTCTTCGTGGCAAGAAGGTCCCTATAAAGGGGAAGAATTTCAAGGTCCGTCAGATC 

Bed 

Acil Sthl32I Eco57l| 

FI1U4HI Dral Haell BscGI | Bsal | | Bfal 

Taul Msel I Hhal | CviJI | | BsmAI | j BsaXI | 

I II II M I I II II 

CCGCCAACAAAACTTTAAAGAGAGTGGCGCTGAAGCCCGTGCTTTTGATGATGGAGACCT 

2521 + + + + + + 2580 

GGCGGTTGTTTTGAAATTTCTCTCACCGCGACTTCGGGCACGAAAACTAC^ 

Bsmi Hpyl88IX MboII 

I i I 

AGTGAACTGCTCTATCCCTGTCGGCATTCGGTTAGAAAAAATCTCCGAAGATGAAAAAAA 

2581 + + + + + + 2640 

TCACTTGACGAGATAGGGACAGCCGTAAGCCAATCrm"lTAGAGGCTTCTACTTTT^ 



Hpyl78III 
T3P509I TaqI | 



CviJI 
Bfal I 



BsaBI HphI 



Sthl32I 
BscGI Thai 



TAATTTCGAGATTTCTCTAGCCTACATTGGTGATGTGTATCGTAAAAATCCCCGTTCGCG 

2641 + + --- + + + + 2700 

ATTAAAGCTCTAAAGAGATCGGATGTAACCACTACACATAGCATTTTTAGGGGCAAGCGC 
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Fig. 8 (con't) 



CviJI 

NlalVl Mnll 
Rsal TspRI I Mnll Mwol 

I II I I 

TACTTCrCTAATGGTCAGTGGAGCCrCTTGGACrrTCGCTATGTAAAAACCrrCGC^ 

2701 + + + -- + + + 2760 

ATGAAGAGATTACCAGTCACCTCGGAGAACCTGAAGCGATACATTTTTGGAGCGTGCTGT 



Hinf I 
Hpyl88IX 
Bed I 

Mwol Plel I I 

CviJI Ddel I CviJI | ( ( 

I II Mi l 



Bbvl 
Mnll I 
Nlalll I I 

II 



AGCCTTCTTAGCAAGTGCTGGAAGCCATCTGACTCTCTCCCCTCATGTAGAACTCTCTGG 

2761 + + : + + + + 2820 

TCGGAAGAATCGTTCACGACCTTCGGTAGACTGAGAGAGGGGAGTACATCTTGAGAGACC 

Pnu4HI 

Alul| Bpull02I 

CviJI I Alul Ddel RleAI Ddel 

Tselj CviJI CviJI I BseMIll Cvi JI ( 

II I II II II 

GGAAGCTGCTTATGAGCTTCGTGGCTCAGCACACATCTACAATGTAGATTGTGGGCTAAG 

2821 + + + + + + 2880 

CCTTCGACGAATACTCGAAGCACCGAGTCGTGTGTAGATGTTACATCTAACACCCGATTC 

^Ol 

Mnll ECORI 
Bfal Mnll I Tsp509I 

I II I 

ATACTCATTCTAGTTCCTACTTTCCTCCCTAAACTTTTAGGGAGGAATTCTTATAAAAAC 

2881 + + + + + + 2940 

TATGAGTAAGATCAAGGATGAAAGGAGGGATTTGAAAATCCCTCCTTAAGAATATTTT^ 

Tsp509I 

Hinfl Bfal Msel | 

Sfcl Tfil Msel Spell BsmAI Mnll || CviRI 

I I I II I I II I 

CCTGTAGATTCTTAACTTACTAGTCTCTCCTTTCCTCTTGCTTTCTTTAAT^ 

2941 + + + + + + 3000 

GGACATCTAAGAATTGAATGATCAGAGAGGAAAGGAGAACGAT^GAAATTAAATAACGTC 
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Figure 9: Nucleotide and amino acid sequence of CPNl 00708 

ataaagtttt ttatatgaac aaaactttga cttcattggc agacacttcg accttaacag 60 

accaattttg ttgtcatccc tataaaaatc aggaattttc atg etc tec tea eta 115 

Met Leu Ser Ser Leu 
1 5 

ate cgt gat tea ttt ccc ctt ctt att tta ctt ccc aca ttc eta gcg 163 
He Arg Asp Ser Phe Pro Leu Leu He Leu Leu Pro Thr Phe Leu Ala 
10 15 20 

gea tta gga gee tec gta get ggc ggc gtt atg gga acc tat ate gtt 211 
Ala Leu Gly Ala Ser Val Ala Gly Gly Val Met Gly Thr Tyr He Val 
25 30 35 

gta aaa cgt att gtt tea att agt gga agt ata tct cat gea att eta 259 
Val Lys Arg He Val Ser He Ser Gly Ser He Ser His Ala He Leu 
40 45 50 

gga gga att ggc etc acc eta tgg ata caa tat aag ctt cat etc tct 307 
Gly Gly He Gly Leu Thr Leu Trp He Gin Tyr Lys Leu His Leu Ser 
55 60 65 

ttt ttc cet atg tat gga get att gta gga get att ttt eta get ctt 355 
Phe Phe Pro Met Tyr Gly Ala He Val Gly Ala He Phe Leu Ala Leu 
70 75 80 85 

tgc ate ggc aag ate cae ctg aaa tac caa gaa agg gaa gac tet ttg 403 
Cys He Gly Lys He His Leu Lys Tyr Gin Glu Arg Glu Asp Ser Leu 
90 95 100 

att gcg atg att tgg tct gtg ggc atg gea att gga att ata ttc att 451 
He Ala Met He Trp Ser Val Gly Met Ala He Gly He He Phe He 
105 110 115 

tec agg ctt ccc acc ttt aat gga gag etc ate aat ttt eta ttt ggg 499 
Ser Arg Leu Pro Thr Phe Asn Gly Glu Leu He Asn Phe Leu Phe Gly 
120 125 130 

aac att etc tgg gtc acc cct tea gac etc tat age tta gga ate ttt 54 7 
Asn He Leu Trp Val Thr Pro Ser Asp Leu Tyr Ser Leu Gly He Phe 
135 140 145 

gat ctt ctt gtt tta gga att gtg gtc ctt tgc cac acc egg ttc ctt 595 
Asp Leu Leu Val Leu Gly He Val Val Leu Cys His Thr Arg Phe Leu 
150 155 160 165 

get ctt tgc tzz gar gag agg tac acg get tra aac cat tgt tct gta c43 
Ala Leu Cys Phe Asp Glu Arg Tyr Thr Ala Leu Asn His Cys Ser Val 
170 175 180 

cag ctg tgg tat ttc eta ctt ctt gtt ctg aca gea ate acg att gtg 691 
Gin Leu Trp T%'r Phe Leu Leu Leu Val Leu Thr Ala He Thr He Val 
185 190 195 
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Fig. 9 (con't) 

atg ttg att tat gtg atg gga acg att ctg atg ctt age atg etc gtc 739 
Met Leu lie Tyr Val Met Gly Thr lie Leu Met Leu Ser Met Leu Val 
200 205 210 

tta cct gtt get ata gcg tgt aga ttt teg tae aag atg aea ega att 787 
Leu Pro Val Ala lie Ala Cys Arg Phe Ser Tyr Lys Met Thr Arg He 
215 220 225 

atg ttc ate teg gte etc ttg aat ate tta tgt tct ttt tct gga att 835 
Met Phe He Ser Val Leu Leu Asn He Leu Cys Ser Phe Ser Gly He 
230 235 240 245 

tgc ate gee tac tgt tta gat ttc eea gta ggt cct acg ata tea ttg 8B3 
Cys He Ala Tyr Cys Leu Asp Phe Pro Val Gly Pro Thr He Ser Leu 
250 255 260 

ctg atg ggg tta ggt tat aea gcg agt ett tgt gtg aag aag egg tac 931 
Leu Met Gly Leu Gly Tyr Thr Ala Ser Leu Cys Val Lys Lys Arg Tyr 
265 270 275 

aat ccg teg aeg cct tct cct gta agt eet gaa ate aat aea aat gta 979 
Asn Pro Ser Thr Pro Ser Pro Val Ser Pro Glu He Asn Thr Asn Val 
280 285 290 

tagetaggga agcgcttttg gaagetttgg aggcattctt cctgttegte aggaagaaga 1039 

tcatcaattt tatttaaagc taecagcata tctttetttt caaaatctgg etgatgagag 1099 

t 1100 



SUBSTITUTE SHEET (RULE 26) 



wo 00/32794 



PCT/CA99/0I147 



69/77 



Figure 10: Restriction enzyme analysis of CPNl 00708 

Bcgl TaqI Msel 

I II 

ATAAAGlTrT'ri'ATATGAACAAAACTTTGACTTCATTGGCAGACACTTCGACCTTAACAG 
1 + ^ + + + + go 

TATTTCAAAAAATATACTTGTTTTGAAACTGAAGTAACCGTCTGTGAAGCTGGAATTGTC 

BseRI 

Tsp509I Apol I 

Bcgl I Tsp509I I 

Foklll Hpyl78III I jNlalll Mnll 

III Mil I 

ACCAATTTTGTTGTCATCCCTATAAAAATCAGGAATTTTCATGCTCTCCTCACT 
61 + _ + + + + 120 

TGGTTAAAACAACAGTAGGGATATTTTTAGTCCTTAAAAGTACGAGAGGAGTGATTAGGC 

RleAI 
Fnu4HI I BsaXI 
Taul I CviJI I 

Hinfl Acil| I NlalVl j 

TfiX B£aX II |MwoI II jMwoI 

I I II I I II I I 

TGATTCATTTCCCCITCTTATTTTACITCCCACATTCCT 

121 + + + + + + 180 

ACTAAGTAAAGGGGAAGAATAAAATGAAGGGTGTAAGGATCGCCGTAATCCTCGGAGGCA 

Pnu4HI 
Taul 
Acil 
CacBI 
Mnll 
Alul I 
CviJI I 
I I 



NlalV Maell Tsp509I 

I II 
AGCTGGCGGCGTTATGGGAACCTATATCGTTGTAAAACGTATTGTTTCAATTAGTGGAAG 

181 + + + + + + 240 

TCGACCGCCGCAATACCCrnrGGATATAGCAACATTTrGCATAACAA^ 

Bfal CviJI 
MnlX I Hael 
Tsp509I I I Haelll Alul 

CviRI I I I HphI I CviJI 
Nlalllj I lT3p509X| I BciVI Mnll Hindlll j 

III I Mill II 

TATATCTCATGCAATTCTAGGAGGAATTGGCCTCACCCTATGGATACAATATAAGCTTCA 

241 + + + + + + 300 

ATATAGAGTACGTTAAGATCCTCCTTAACCGGAGTGGGATACCTATGTTATATTCGAAGT 

BsaXI Alul 
Alul I Alul CviJI 

BslI CviJI I CviJI Bfal | CviRI 

III I II I 
TCTCTCrrnTl'CCCTATGTATCkSAGCTATTGTAGGAGCTATTTTTCTAGCTCTTTGCAT 
301 + + + + :__ + + 360 

AGAGAGAAAAAAGGGATACATACCTCGATAACATCCTCGATAAAAAGATCGAGAAACGTA 
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Dpnl 
BstYI 
Sau3AI 
SfaNI 
Alwl I 



Plel Bbsl 
Bsll| Hinfl iMboII 
Mil I 



RleAI 



CGGGAAGATCCACCTGAAATACCAAGAAAGGGAAGACTCTTTGATTGCGATGATTTGGTC 

361 + + + + + + 

GCCGTTCTAGGTGGAOTTTATGGTTCTTTCCCTTCTGAGAAACTAACGCT 



420 



Banll 
BsiHKAI 
B8P1286I 

Tsp509I CviJI Sad 

Muni I Cjel I Alul I 

Tsp509I 1 ScrFI j CviJI | 

Nlalll I I EcoRII I I Msel Hiii4I j 

III III i II 

TGTGGGCATGGCAATTGGAATTATATTCATTTCCAGGCTTCCCACCIT^ 

421 + + + + 480 

ACACCCGTACCGTTAACCTTAATATAAGTAAAGGTCCGAAGGGTGGAAATTACCTCTCGA 



Hinfl 
Mnll 
BpulOI t 
Ddelj 
Alul I I 
Cjel CviJI i I 
•HpylBSIX ISfcl I I |TfiI 

I I I II I I I I III I 

CATCAATTTTCTATTTGGGAACATTCTCTGGGTCACCCCTTCAGACCTC^ 

481 + + + + + + 

GTAGTTAAAAGATAAACCCTTGTAAGAGACCCAGTGGGGAAGTCTGGAGATATCGAATCC 



Cjel 
Cjel I 
Tsp509I I I 



BstEII 
Maelll 
HphI SimX 
Eco57l|Tsp45I 



540 



Dpnl 
SauBAX I 
MboII I I 



I I 



Avail 
Cjel 

Tsp509I Sau96I 



Sthl32I 
Nlaiv 
Mspl I 
Neil I 
ScrFI 



AATCTTTGATCTTCTTGTTTTAGGAATTGTGGTCCTTTGCCACACCCGGOT 

541 + + + + + 600 

TTAGAAACTAGAAGAACAAAATCCTTAACACCAGGAAACGGTGTGGGCCAAGGAACGAGA 



Alul 
CviJI 
MspAlI 
PvuII 
Rsal 
BsrGI I 
TatI I 
Bcefl I 



Mnll 
Cjel I 



Dral 
Msel I 

Rsal CviJI I I 



I II 



TTGCTTTGATGAGAGGTACACGGCTTTAAACCATTGTTCTGTACAGCTGTGGTATTTCCT 

601 + + + + + + 660 

AACGAAACTACTCTCCATGTGCCGAAATTTGGTAACAAGACATGTCGACACCATAAAGGA 
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Fig. lO(con't) 



Hpyl88IX 



Hpyl88IX 



MslI 
Hpyl78III I 
I I 



Hinfl 
SfaNI j 
Bcgl Bed |TfiI 

I III 



I 



AL UT Cl " rGTTCTGACAGCAATCACGATTGTGATGTTGATTTATGTGATGGGAACGATTCT 

661 + + + + + + 720 

TGAAGAAO^GACTGTCGTTAGTGCrrAACACTACAACTAAATACACTACCCTTGCTAAGA 

Nlalll 
Nspl 
SphI 

Bpull02I Cac8I I 

Ddel Mwol| |BcgI Sfcl Rsal 

I III I I I 

GATGCrrAGCATGCrCGTCTTACCTGTTGCTATAGCGTGTAGATTTT 

721 + + + + + + 780 

CTACGAATCGTAOSAGCAGAATGGACAACGATATCGCACATCTAAAAGCATGTTCTA 



Taqll 
Tsp509I I 



Hpyl78III 
Avail I 
Sau96I 1 Mnll 



Apol 
Tsp509I 
Hpyl78III 1 CviRI 



ACGAATTATGTTCATCTCGGTCCrCTTGAATATCrTATGTTCrr 

781 + + + + + + 840 

TGinTAATACAAGTAGAGCCyVGGAGAACTTATAGAATACAAGAAAAAGACCOT 



Taal 
SfaNI I 



BsrDI 
EcoRV 
PflllOSI 
Avail 
ECOO109I 
PspSII 
5au96I 
Sse8647I 
Bcgl I 
Bmrl| I 
Bsrill I 

III I 



Bed 



I 



Bcgl 



CX3CCrrACTGTTTAGATTTCCCAGTAGGTCCTACGATATCATTGCTGATGGGGTTAGG^ 

841 + + + - + + + 900 

GCGGATGAOVAATCTAAAGGGTCATCCAGGATGCTATAGTAACGACTACCCCAATCCAAT 



BsaXI 
Hin4I 
BsaHI 
Hindi 
Mboll Acd I 
Rsal I TaqI | 

Hinfl Plel Acil | | Sall| j ( | |HgaI Hpyl78III 

I I I II III I II I I 

TACAGCX5AGTCTTTGTGTGAAGAAGCGGTACAATCCGTCGACGCCTTCTCCTGTAAGTCC 

901 + + + + + + 

ATGTCGCrCAGAAACACACTTCTTCGCCATGTTAGGCAGCTGCGGAAGAGGACATTCAGG 



960 
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Fig. 10(con't) 

Haell Alul 
Bfal Hhal| CviJI Bsml 

Alul] Eco47IIl|| Mnll| Mwol | 
CviJI I Mwol I I 1 Hindlll | | MboII | j 

II I III ill 11 I 

TGAAATCAATACAAATGTATAGCTAGGGAAGCGCITITGGAAGCTTT 

961 + + + + + + 1020 

ACTTTAGTTATGTTTACATATCGATCCCTTCGCGAAAACCTTCGAAACCTCCGTAAGAAG 

MboII 



MboII 
Tsp509I 
Dpnl I 
Hpyl78III Sau3AI | | 
I II 



Alul 
CviJI 
Dral I 
Msel I I 
M I 

CTGTTCGTCAGGAAGAAGATCATCAATTTTATTTAAAGCTACCAGCATATCTTTCTT^ 

1021 + + -+ + r---*. + 1080 

GACAAGCAGTCCTTCTTCTAGTAGTTAAAATAAATTTC6ATGGTCGTATAGAAAGAAAAG 

CviJI 
1 

AAAATCTGGCTGATGAGAGT 

1081 + + 1100 

TTTTAGACCGACTACTCTCA 
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Figure 1 1 : Identification of T- and B- cell epitopes from open reading frame ORF 
cpn1 00634 (SEQ ID NO: 11) 

1 MKTSIPWVLV SSVLAFSCHL QSLANEELLS PDDSFNGNID SGTFTPKTSA 

51 TTYSLTGDVF FYEPGKGTPL SDSCFKQTTD NLTFLGNGHS LTFGFIDAGT 

101 HAGAAASTTA NKNLTFSGFS LLSFDSSPST TVTTGQGTLS SAGGVNLENI 

151 RKLWAGNFS TADGGAIKGA SFLLTGTSGD ALFSNNSSST KGGAIATTAG 

201 ARIANNTGYV RFLSNIASTS GGAIDDEGTS ILSNNKFLYF EGNAAKTTGG 

251 AICNTKASGS PELIISNNKT LIFASNVAET SGGAIHAKKL ALSSGGFTEF 

301 LRNNVSSATP KGGAISIDAS GELSLSAETG NITFVRNTLT TTGSTDTPKR 

351 NAINIGSNGK FTELRAAKNH TIFFYDPITS EGTSSDVLKI NNGSAGALNP 

401 YQGTILFSGE TLTADELKVA DNLKSSFTQP VSLSGGKLLL QKGVTLESTS 

451 FSQEAGSLLG MDSGTTLSTT AGSITITNLG INVDSLGLKQ PVSLTAKGAS 

501 NKVIVSGKLN LIDIEGNIYE SHMFSHDQLF SLLKITVDAD VDTNVDISSL 

551 IPVPAEDPNS EYGFQGQWNV NWTTDTATNT KEATATWTKT GFVPSPERKS 

601 ALVCNTLWGV FTDIRSLQQL VEIGATGMEH KQGFWVSSMT NFLHKTGDEN 

651 RKGFRHTSGG YVIGGSAHTP KDDLFTFAFC HLFARDKDCF lAHNNSRTYG 

701 GTLFFKHSHT LQPQNYLRLG RAKFSESAIE KFPREIPLAL DVQVSFSHSD 

751 NRMETHYTSL PESEGSWSNE CIAGGIGLDL PFVLSNPHPL FKTFIPQMKV 

801 EMVYVSQNSF FESSSDGRGF SIGRLLNLSI PVGAKFVQGD IGDSYTYDLS 

851 GFFVSDVYRN NPQSTATLVM SPDSWKIRGG NLSRQAFLLR GSNNYVYNSN 
901 CELFGHYAME LRGSSRNYNV DVGTKLRF 



Possible T cell epitope: 
824 RliLNLSIPV 
Possible B cell epitope : 
644 HKTGDENRKGFRHTS 
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Figure 12: Identification of T- and B- cell epitopes from open reading frame ORF: 
cpn1 00635 (SEQ ID.NO: 12) 

1 MKSQFSWLVL SSTLACFTSC STVFAATAEN IGPSDSFDGS TNTGTYTPKN 
51 TTTGIDYTLT GDITLQNLGD SAALTKGCFS DTTESLSFAG KGYSLSFLNI 
101 KSSAEGAALS VTTDKNLSLT GFSSLTFLTVA PSSVITTPSG KGAVKCGGDL 
151 TFDNNGTILF KQDYCEENGG AISTKNLSLK NSTGSISFEG NKSSATGKKG 
201 GAICATGTVD ITNNTAPTLF SNNIAEAAGG AINSTGNCTI TGNTSLVFSE 
251 NSVTATAGNG GALSGDADVT IS(2gQSVTFS GNQAVANGGA lYAKKLTLAS 
301 GGGGGNPFSN NIVQGTTAGN GGAISILAAG ECSLFSEAGD HYLNGNAIVA 
351 TTPQTTKRNS IDIGSTGKDH ELRAISGHSI FFYDPITANT AADSTDTLNL 
401 NKADAGNSTD YSGSIVFSGE KLSEDEAKVA DNLTSTLKQP VTLTAGNLVL 
451 KRGVTLDTKG FTQTAGSSVI MDAGTTLKAS TEEVTLTGLS IPVDSLGEGK 
501 KWIAASAAS KNVALSGPIL LLDNQGNAYE NHDLGKTQDF SFVQLSALGT 
551 ATTTDVPAVP TVATPTHYGY QGTWGMTWVD DTASTPKTKT ATLAWTNTGY 
601 LPNPERQGPL VPNSLWGSFS DIQAIQGVIE RSALTLCSDR GFWAAGVANF 
651 LDKDKKGEKR KYRHKSGGYA IGGAAQTCSE NLISFAFCQL FGSDKDFLVA 
701 KNHTDTYAGA FYIQHITECS GFIGCLLDKL PGSWSHKPLV LEGQLAYSHV 
751 SNDLKTKYTA YPEVKGSWGN NAFNMMLGAS SHSYPEYIiHC FDTYAPYIKL 
801 NLTYIRQDSF SEKGTEGRSF DDSNLFNLSL PIGVKFEKFS DCNDFSYDLT 
851 LSYVPDLIRN DPKCTTALVI SGASWETYAN NLARQALQVR AGSHYAFSPM 
901 FEVLGQFVFE VRGSSRIYNV DLGGKFQF 



Possible T cell epitope: 
903 VLGQFVFEV 
Possible B cell epitope: 
DKDKKGEKRKYRHKSG 
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Figure 13: Identification of T- and B- cell epitopes from open reading frame ORF: 
cpn100638 (SEQ ID NO: 14) 

1 MKSSLHWFLI SSSLALPLSL NFSAFAAWE INLGPTNSFS GPGTYTPPAQ 

51 TTNADGTIYN LTGDVSITNA GSPTALTASC FKETTGNLSF QGHGYQFLLQ 

101 NIDAGANCTF TNTAANKLLS FSGFSYLSLI QTTNATTGTG AIKSTGACSI 

151 QSNYSCYFGQ NFSNDNGGAL QGSSISLSLN PNLTFAKNKA TQKGGALYST 

201 GGITINNTLN SASFSENTAA NNGGAIYTEA SSFISSNKAI SFINNSVTAT 

251 SATGGAIYCS STSAPKPVLT LSDNGELNFI GNTAITSGGA lYTDNLVLSS 

301 GGPTLFKNNS GYDTAAPLGG AIAIADSGSL SLSALGGDIT FEGNTWKGA 

351 SSSQTTTRNS INIGNTNAKI VQLRASQGNT lYFYDPITTS ITAALSDALN 

401 LNGPDLAGMP AYQGTIVFSG EJCLSEAEAAE ADNLKSTIQQ PLTLAGGQLS 

451 LKSGVTLVAK SFSQSPGSTL LMDAGTTLET ADGITINNLV LNVDSLKETK 

501 KGTLKATQAS QTVTLSGSLS LVDPSGNVYE DVSWNNPQVF SCLTLTADDP 

551 ANIHITDLAA DPLEKNPIHW GYQGNWALSW QEDTATKSKA ATLTWTKTGY 

601 NPNPERRGTL VANTLWGSFV DVRSIQQLVA TKVRQSQETR GIWCEGISNF 

651 FHKDSTKINK GFRHISAGYV VGATTTLASD NLITAAFCQL FGKDRDHFIN 

701 KNRASAYAAS LHLQHLATLS SPSLLRYLPG SESEQPVLFD AQISYIYSKN 

751 TMKTYYTQAP KGESSWYNDG CALELASSLP HTALSHEGLF HAYFPFIKVE 

801 ASYIHQDSFK ERNTTLVRSF DSGDLINVSV PIGITFERFS RNERASYEAT 

851 VIYVADVYRK NPDCTTALLI NNTSWKTTGT NLSRQAGIGR AGIFYAFSPN 

901 LEVTSNLSME IRGSSRSYNA DLGGKFQF 



Possible T cell epitope: 
614 TLWGSFVDV 
Possible B cell epitope: 
595 WTKTGYNPNPERRG 
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Figure 14: Identification of T- and B- cell epitopes from open reading frame ORF: 
cpn100639(SEQIDNO:15) 

1 MKIPLHKLLI SSTLVTPILL SIATYGADAS LSPTDSFDGA GGSTFTPKST 

51 ADANGTNYVL SGNVYINDAG KGTALTGCCF TETTGDLTFT GKGYSFSFNT 

101 VDAGSNAGAA ASTTADKALI FTGFSNLSFI AAPGTTVASG KSTLSSAGAL 

151 NLTDNGTILF SQNVSNEANN NGGAITTKTL SISGNTSSIT FTSNSAKKLG 

201 GAIYSSAAAS ISGNTGQLVF MNNKGETGGG ALGFEASSSI TQNSSLFFSG 

251 NTATDAAGKG GAIYCEKTGE TPTLTISGNK SLTFAENSSV TQGGAICAHG 

301 LDLSAAGPTL FSNNRCGNTA AGKGGAIAIA DSGSLSLSAN QGDITFLGNT 
351 LTSTSAPTST RNAIYLGSSA KITNLRAAQG QSIYFYDPIA SNTTGASDVL 

401 TINQPDSNSP LDYSGTIVFS GEKLSADEAK AADNFTSILK QPLALTVSGTL 

451 ALKGNVELDV NGFTQTEGST LLMQPGTKLK ADTEAISLTK LWDLSALEG 

501 NKSVSIETAG ANKTITLTSP LVFQDSSGNF YESHTINQAF TQPLWFTAA 

551 TAASDIYIDA LLTSPVQTPE PHYGYQGHWE ATWADTSTAK SGTMTWVTTG 

601 YNPNPERRAS WPDSLWASF TDIRTLQQIM TSQANSIYQQ RGLWASGTAN 

651 FFHKDKSGTN QAFRHKSYGY IVGGSAEDFS ENIFSVAFCQ LFGKDKDLFI 

701 VENTSHNYLA SLYLQHRAFL GGLPMPSFGS ITDMLKDIPL ILNAQLSYSY 

751 TKNDMDTRYT SYPEAQGSWT NNSGALELGG SLALYLPKEA PFFQGYFPFL 

801 KFQAVYSRQQ NFKESGAEAR AFDDGDLVNC SIPVGIRLEK ISEDEKNNFE 

851 ISLAYIGDVY RKNPRSRTSL MVSGASWTSL CKNLARQAFL ASAGSHLTLS 
901 PHVELSGEAA YELRGSAHIY NVDCGLRYSF 



Possible T cell epitope : 

7 KLLISSTLV 
Possible B cell epitopes : 
839 EKISEDEKNNF 
860 YRKNPRSRT 
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Figure 15; Identification of T- and B- cell epitopes from open reading frame ORF^ 
cpn1 00708 (SEQ ID NO: 16) 

1 MLSSLIRDSF PLLILLPTFL AALGASVAGG VMGTYIWKR IVSISGSISH 
51 AILGGIGLTL WIQYKLHLSF FPMYGAIVGA IFLALCIGKI HLKYQEREDS 
101 LIAMIWSVGM AIGIIFISRL PTFNGELINF LFGNILWVTP SDLYSLGIFD 
151 LLVLGIWLC HTRFLALCFD ERYTALNHCS VQLWYFLLLV LTAITIVMLI 
201 YVMGTILMLS MLVLPVAIAC RFSYKMTRIM FISVLLNILC SFSGICIAYC 
251 LDFPVGPTIS LLMGLGYTAS LCVKKRYNPS TPSPVSPEIN TNV 



Possible T cell epitope: 
130 FLFGNILWV 
Possible B cell epitope: 
91 HLKYQEREDS 
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SEQUENCE LISTING 
<110> Connaught Laboratories Limited 

<120> Chlamydia antigens and corresponding DNA fragments and uses thereof 

<130> 77813-2 

<140> 

<141> 

<150> OS 60/110,427 
<150> US 60/110,438 
<150> OS 60/110,339 
<150> OS 60/110,428 
<150> OS 60/110,340 
<151> December 1, 1998 
<160> 16 

<170> Patent In Ver. 2.0 



<210> 1 
<211> 2950 
<212> DNA 

<213> Chlamydia pneumoniae 

<220> 

<221> CDS 

<222> (101) . . (2884) 
<400> 1 

gataaaaatt cttgacagct gttttgtcat ctttaacttg atttacttat tttgtttcta 60 

tattgatgcg aatagttctc taaaaaacaa aagcattacc atg aag act teg att 115 

Met Lys Thr Ser He 
1 5 

cot tgg gtt tta gtt tec tec gtg tta get ttc tea tgt cac eta cag 163 
Pro Trp Val Leu Val Ser Ser Val Leu Ala Phe Ser Cys His Leu Gin 
10 15 20 

tea eta get aac gag gaa ctt tta tea cet gat gat age ttt aat gga 211 
Ser Leu Ala Asn Glu Glu Leu Leu Ser Pro Asp Asp Ser Phe Asn Gly 
25 30 35 

aat ate gat tea gga acg ttt act cca aaa act tea gee aca aca tat 259 
Asn He Asp Ser Gly Thr Phe Thr Pro Lys Thr Ser Ala Thr Thr Tyr 
40 45 50 

tct eta aca gga gat gtc ttc ttt tae gag cet gga aaa gge act ccc 307 
Ser Leu Thr Gly Asp Val Phe Phe Tyr Glu Pro Gly Lys Gly Thr Pro 
55 60 65 

tta tct gae agt tgt ttt aag caa acc acg gac aat ctt acc ttc ttg 355 
Leu Ser Asp Ser Cys Phe Lys Gin Thr Thr Asp Asn Leu Thr Phe Leu 
70 75 80 85 

ggg aac ggt cat age tta acg ttt ggc ttt ata gat get ggc act cat 403 
Gly Asn Gly His Ser Leu Thr Phe Gly Phe He Asp Ala Gly Thr His 
90 95 100 
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gca ggt get get gca. tct aca aca gca aat aag aat ctt acc ttc tea 451 
Ala Gly Ala Ala Ala Ser Thr Thr Ala Asn Lys Asn Leu Thr Phe Ser 
105 110 115 

ggg ttt tec tta etg agt ttt gat tec tct ect age aca acg gtt act 499 
Gly Phe Ser Leu Leu Ser Phe Asp Ser Ser Pro Ser Thr Thr Val Thr 
120 125 130 

aca ggt eag gga acg ctt tec tea gca gga ggc gta aat tta gaa aat 547 
Thr Gly Gin Gly Thr Leu Ser Ser Ala Gly Gly Val Asn Leu Glu Asn 
135 140 145 

att cgt aaa ctt gta gtt get ggg aat ttt tct act gca gat ggt gga 595 
He Arg Lys Leu Val Val Ala Gly Asn Phe Ser Thr Ala Asp Gly Gly 
150 155 160 165 

get ate aaa gga geg tct ttc ctt tta act ggc act tct gga gat get 643 
Ala He Lys Gly Ala Ser Phe Leu Leu Thr Gly Thr Ser Gly Asp Ala 
170 175 180 

ctt ttt agt aac aac tct tea tea aca aag gga gga gca att get act 691 
Leu Phe Ser Asn Asn Ser Ser Ser Thr Lys Gly Gly Ala He Ala Thr 
185 190 195 

aca gca ggc get ege at a gea aat aac aca ggt tat gtt aga ttc eta 739 
Thr Ala Gly Ala Arg He Ala Asn Asn Thr Gly Tyr Val Arg Phe Leu 
200 205 210 

tct aac ata gcg tct acg tea gga ggc get ate gat gat gaa ggc acg 787 
Ser Asn He Ala Ser Thr Ser Gly Gly Ala He Asp Asp Glu Gly Thr 
215 220 225 

teg ata eta teg aac aac aaa ttt eta tat ttt gaa ggg aat gca gcg 835 
Ser He Leu Ser Asn Asn Lys Phe Leu Tyr Phe Glu Gly Asn Ala Ala 
230 235 240 245 

aaa act act ggc ggt gcg ate tgc aac ace aag gcg agt gga tct ect 883 
Lys Thr Thr Gly Gly Ala He Cys Asn Thr Lys Ala Ser Gly Ser Pro 
250 255 260 

gaa etg ata ate tct aac aat aag act etg ate ttt get tea aac gta 931 
Glu Leu He He Ser Asn Asn Lys Thr Leu He Phe Ala Ser Asn . Val 
265 270 275 

gca gaa aca age ggt ggc gee ate cat get aaa aag eta gee ctt tec 979 
Ala Glu Thr Ser Gly Gly Ala He His Ma Lys Lys Leu Ala Leu Ser 
280 285 290 

tct gga ggc ttt aca gag ttt eta cga aat aat gtc tea tea gca act 1027 
Ser Gly Gly Phe Thr Glu Phe Leu Arg Asn Asn Val Ser Ser Ala Thr 
295 300 305 

ect aag ggg ggt get ate age ate gat gee tea gga gag etc agt ctt 1075 
Pro Lys Gly Gly Ala He Ser He Asp Ala Ser Gly Glu Leu Ser Leu 
310 315 320 325 
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tct gca gag aca gga aac att acc ttt gta aga aat acc ctt aca aca 1123 
Ser Ala Glu Thr Gly Asn He Thr Phe Val Arg Asn Thr Leu Thr Thr 
330 335 340 

acc gga agt acc gat act cct aaa cgt aat gcg ate aac ata gga agt 1171 
Thr Gly Ser Thr Asp Thr Pro Lys Arg Asn Ala He Asn lie Gly Ser 
345 350 355 

aac ggg aaa ttc acg gaa tta egg get get aaa aat cat aca att ttc 1219 
Asn Gly Lys Phe Thr Glu Leu Arg Ala Ala Lys Asn His Thr He Phe 
360 365 370 

ttc tat gat ccc ate act tea gaa gga acc tea tea gac gta ttg aag 1267 
Phe Tyr Asp Pro He Thr Ser Glu Gly Thr Ser Ser Asp Val Leu Lys 
375 380 385 

ata aat aac ggc tct gcg gga get etc aat eca tat caa gga acg att 1315 
He Asn Asn Gly Ser Ala Gly Ala Leu Asn Pro Tyr Gin Gly Thr He 
390 395 400 405 

eta ttt tct gga gaa ace eta aca gca gat gaa ctt aaa gtt get gac 1363 
Leu Phe Ser Gly Glu Thr Leu Thr Ala Asp Glu Leu Lys Val Ala Asp 
410 415 420 

aat tta aaa tct tea ttc acg cag eca gtc tec eta tec gga gga aag 1411 
Asn Leu Lys Ser Ser Phe Thr Gin Pro Val Ser Leu Ser Gly Gly Lys 
425 430 435 

tta ttg eta caa aag gga gtc act tta gag age acg age ttc tct caa 1459 
Leu Leu Leu Gin Lys Gly Val Thr Leu Glu Ser Thr Ser Phe Ser Gin 
440 445 450 

gag gee ggt tct etc etc ggc atg gat tea gga acg aca tta tea act 1507 
Glu Ala Gly Ser Leu Leu Gly Met Asp Ser Gly Thr Thr Leu Ser Thr 
455 460 465 

aca get ggg agt att aca ate acg aac eta gga ate aat gtt gac tec 1555 
Thr Ala Gly Ser He Thr He Thr Asn Leu Gly He Asn Val Asp Ser 
470 475 480 485 

tta ggt ctt aag cag ccc gtc age eta aca gca aaa ggt get tea aat 1603 
Leu Gly Leu Lys Gin Pro Val Ser Leu Thr Ala Lys Gly Ala Ser Asn 
490 495 500 

aaa gtg ate gta tct ggg aag etc aac ctg att gat att gaa ggg aac 1651 
Lys Val ile Val Ser Gly Lys Leu Asn Leu He Asp He Glu Gly Asn 
505 510 515 

att tat gaa agt eat atg ttc age cat gac cag etc ttc tct eta tta 1699 
He Tyr Glu Ser His Met Phe Ser His Asp Gin Leu Phe Ser Leu Leu 
520 525 530 

aaa ate acg gtt gat get gat gtt gat act aac gtt gac ate age age 1747 
Lys Ile Thr Val Asp Ala Asp Val Asp Thr Asn Val Asp lie Ser Ser 
535 540 545 
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ctt ate cct gtt cct get gag gat cct aat tea gaa tac gga tte caa 1795 
Leu lie Pro Val Pro Ala Glu Asp Pro Asn Ser Glu Tyr Gly Phe Gin 
550 555 560 565 

gga caa tgg aat gtt aat tgg act acg gat aca get aca aat aca aaa 1843 
Gly Gin Trp Asn Val Asn Trp Thr Thr Asp Thr Ala Thr Asn Thr Lys 
570 575 580 

gag gee acg gea act tgg aec aaa aca gga ttt gtt ecc age ccc gaa 1891 
Glu Ala Thr Ala Thr Trp Thr Lys Thr Gly Phe Val Pro Ser Pro Glu 
585 590 595 

aga aaa tct geg tta gta tge aat aec eta tgg gga gtc ttt act gac 1939 
Arg Lys Ser Ala Leu Val Cys Asn Thr Leu Trp Gly Val Phe Thr Asp 
600 605 610 

att egc tct ctg caa eag ctt gta gag ate gge gea act ggt atg gaa 1987 
He Arg Ser Leu Gin Gin Leu Val Glu He Gly Ala Thr Gly Met Glu 
615 620 625 

cac aaa caa ggt ttc tgg gtt tec tec atg acg aac ttc ctg cat aag 2035 
His Lys Gin Gly Phe Trp Val Ser Ser Met Thr Asn Phe Leu His Lys 
630 635 640 645 

act gga gat gaa aat egc aaa gge tte cgt cat ace tct gga ggc tac 2083 
Thr Gly Asp Glu Asn Arg Lys Gly Phe Arg His Thr Ser Gly Gly Tyr 
650 655 660 

gtc ate ggt gga agt get cae act cct aaa gac gac eta ttt aec ttt 2131 
Val He Gly Gly Ser Ala His Thr Pro Lys Asp Asp Leu Phe Thr Phe 
665 670 675 

gcg tte tge cat etc ttt get aga gac aaa gat tgt ttt ate get cac 2179 
Ala Phe Cys His Leu Phe Ala Arg Asp Lys Asp Cys Phe He Ala His 
680 685 690 

aac aac tct aga ace tac ggt gga act tta ttc tte aag cac tct cat 2227 
Asn Asn Ser Arg Thr Tyr Gly Gly Thr Leu Phe Phe Lys His Ser His 
695 700 705 

aec eta caa ccc caa aac tat ttg aga tta gga aga gea aag ttt tct 2275 
Thr Leu Gin Pro Gin Asn Tyr Leu Arg Leu Gly Arg Ala Lys Phe Ser 
710 715 720 725 

gaa tea get ata gaa aaa ttc cct agg gaa att ccc eta gee ttg gat 2323 
Glu Ser Ala He Glu Lys Phe Pro Arg Glu He Pro Leu Ala Leu Asp 
730 735 740 

gtc caa gtt teg tte age cat tea gac aac cgt atg gaa acg cac tat 2371 
Val Gin Val Ser Phe Ser His Ser Asp Asn Arg Met Glu Thr His Tyr 
745 750 755 

aec tea ttg eca gaa tec gaa ggt tct tgg age aac gag tgt ata get 2419 
Thr Ser Leu Pro Glu Ser Glu Gly Ser Trp Ser Asn Glu Cys He Ala 
760 765 770 
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ggt ggt ate ggc eta gac ctt ccc ttt gtt ctt tec aac oca cat cct 2467 
Gly Giy lie Gly Leu Asp Leu Pro Phe Val Leu Ser Asn Pro His Pro 
775 780 785 

ctt ttc aag acc ttc att cca cag atg aaa gtc gaa atg gtt tat gta 2515 
Leu Phe Lys Thr Phe lie Pro Gin Met Lys Val Glu Met Val Tyr Val 
790 795 BOO 805 

tea caa aat age ttc ttc gaa age tct agt gat ggc egt ggt ttt agt 2563 
Ser Gin Asn Ser Phe Phe Glu Ser Ser Ser Asp Gly Arg Gly Phe Ser 
810 815 820 

att gga agg etg ctt aac etc teg att cct gtg ggt gcg aaa ttc gtg 2611 
lie Gly Arg Leu Leu Asn Leu Ser lie Pro Val Gly Ala Lys Phe Val 
825 830 835 

cag ggg gat ate gga gat tec tac ace tat gat etc tea gga ttc ttt 2659 
Gin Gly Asp lie Gly Asp Ser Tyr Thr Tyr Asp Leu Ser Gly Phe Phe 
840 845 850 

gtt tec gat gtc tat egt aac aat ecc caa tct aca gcg act ctt gtg 2707 
Val Ser Asp Val Tyr Arg Asn Asn Pro Gin Ser Thr Ala Thr Leu Val 
855 860 865 

atg age cea gac tet tgg aaa att egc ggt ggc aat ctt tea aga cag 2755 
Met Ser Pro Asp Ser Trp Lys lie Arg Gly Gly Asn Leu Ser Arg Gin 
870 875 880 885 

gca ttt tta etg agg ggt age aac aac tac gtc tac aac tee aat tgt 2803 
Ala Phe Leu Leu Arg Gly Ser Asn Asn Tyr Val Tyr Asn Ser Asn Cys 
890 895 900 

gag etc ttc gga cat tac get atg gaa etc egt gga tet tea agg aac 2851 
Glu Leu Phe Gly His Tyr Ala Met Glu Leu Arg Gly Ser Ser Arg Asn 
905 910 915 

tac aat gta gat gtt ggt ace aaa etc cga ttc tagattgeta aaactceeta 2904 
Tyr Asn Val Asp Val Gly Thr Lys Leu Arg Phe 
920 925 

gttcttctag ggagttttct eatactttta gggaaatatt tgctat 2950 



<210> 2 
<211> 2784 
<212> DNA 

<213> Chlamydia pneumoniae 

<220> 

<221> CDS 

<222> (1) - . (2784) 

<400> 2 

atgaagactt cgattccttg ggttttagtt tecteegtgt tagctttcte atgtcaccta 60 
cagtcactacf ctaacgagga aettttatca cetgatgata gctttaatgg aaatatcgat 120 
tcaggaacgt ttactccaaa aactteagcc acaacatatt ctctaaeagg agatgtcttc 180 
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ttttacgagc ctggaaaagg cactccctta tctgacagct gttttaagca aaccacggac 240 

aatcttacct tcttggggaa cggtcatagc ttaacgtttg gctttataga tgctggcact 300 

catgcaggtg ctgctgcatc tacaacagca aataagaatc ttaccttctc agggttttcc 360 

ttactgagtt ttgattcctc tcctagcaca acggttacta caggtcaggg aacgctttcc 420 

tcagcaggag gcgtaaattt agaaaatatt cgtaaacttg tagttgctgg gaatttttca 480 

ctgcagatgg tggagctatc aaaggagcgt ctttccttta actggcactt ctggagatgc 540 

tctttttagt aacaactctt catcaacaaa gggaggagca attgctacta cagcaggcgc 600 

tcgcatagca aataacacag gttatgttag attcctatct aacatagcgt ctacgtcagg 660 

aggcgctatc gatgatgaag gcacgtcgat actatcgaac aacaaatttc tatattttga 720 

agggaatgca gcgaaaacta ctggcggtgc gatctgcaac accaaggcga gtggatctcc 780 

tgaactgata atctctaaca ataagactct gatctttgct tcaaacgtag cagaaacaag 840 

cggtggcgcc atccatgcta aaaagctagc cctttcctct ggaggcttta cagagtttct 900 

acgaaataat gtctcatcag caactcctaa ggggggtgct atcagcatcg atgcctcagg 960 

agagctcagt ctttctgcag agacaggaaa cattaccttt gtaagaaata cccttacaac 1020 

aaccggaagt accgatactc ctaaacgtaa tgcgatcaac ataggaagta acgggaaatt 1080 

cacggaatta cgggctgcta aaaatcatac aattttcttc tatgatccca tcacttcaga 1140 

aggaacctca tcagacgtat tgaagataaa taacggctct gcgggagctc tcaatccata 1200 

tcaaggaacg attctatttt ctggagaaac cctaacagca gatgaactta aagttgctga 1260 

caatttaaaa tcttcattca cgcagccagt ctccctatcc ggaggaaagt tattgctaca 1320 

aaagggagtc actttagaga gcacgagctt ctctcaagag gccggttctc tcctcggcat 1380 

ggattcagga acgacattat caactacagc tgggagtatt acaatcacga acctaggaat 1440 

caatgttgac tccttaggtc ttaagcagcc cgtcagccta acagcaaaag gtgcttcaaa 1500 

taaagtgatc gtatctggga agctcaacct gattgatatt gaagggaaca tttatgaaag 1560 

tcatatgttc agccatgacc agctcttctc tctattaaaa atcacggttg atgctgatgt 1620 

tgatactaac gttgacatca gcagccttat ccctgttcct gctgaggatc ctaattcaga 1680 

atacggattc caaggacaat ggaatgttaa ttggactacg gatacagcta caaatacaaa 1740 

agaggccacg gcaacttgga ccaaaacagg atttgttccc agccccgaaa gaaaatctgc 1800 

gttagtatgc aataccctat ggggagtctt tactgacatt cgctctctgc aacagcttgt 1860 

agagatcggc gcaactggta tggaacacaa acaaggtttc tgggtttcct ccatgacgaa 1920 

cttcctgcat aagactggag atgaaaatcg caaaggcttc cgtcatacct ctggaggcta 1980 

cgtcatcggt ggaagtgctc acactcctaa agacgaccta tttacctttg cgttctgcca 2040 

tctctttgct agagacaaag attgttttat atcgctcaca acaactctag aacctacggt 2100 

ggaactttat tcttcaagca ctctcatacc ctacaacccc aaaactattt gagattagga 2160 

agagcaaagt tttctgaatc agctatagaa aaattcccta gggaaattcc cctagccttg 2220 

gatgtccaag tttcgttcag ccattcagac aaccgtatgg aaacgcacta tacctcattg 2280 

ccagaatccg aaggttcttg gagcaacgag tgtatagctg gtggtatcgg cctagacctt 2340 

ccttttgttc tttccaaccc acatcctctt ttcaagacct tcattccaca gatgaaagtc 2400 

gaaatggttt atgtatcaca aaatagcttc ttcgaaagct ctagtgatgg ccgtggtttt 2460 

agtattggaa ggctgcttaa cctctcgatt cctgtgggtg cgaaattcgt gcagggggat 2520 

atcggagatt cctacaccta tgatctctca ggattctttg tttccgatgt ctatcgtaac 2580 

aatccccaat ctacagcgac tcttgtgatg agcccagact cttggaaaat tcgcggtggc 2640 

aatctttcaa gacaggcatt tttactgagg ggtagcaaca actacgtcta caactccaat 2700 

tgtgagctct tcggacatta cgctatggaa ctccgtggat cttcaaggaa ctacaatgta 2760 

gatgttggta ccaaactccg attc 2784 



<210> 3 
<211> 2950 
<212> DNA 

<213> Chlamydia pneumoniae 
<220> 
I <221> CDS 

<222> (101) . . (2884) 
<400> 3 
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agttaaagat gacaaaacag ctgtcaagaa tttttatctt gactctctga gttttctatt 60 

ttatatgacg caagtaagaa tttaataata aagtgggttt atg aaa teg caa ttt 115 

Met Lys Ser Gin Phe 
1 5 

tec tgg tta gtg etc tct teg aca ttg gca tgt ttt act agt tgt tec 163 
Ser Trp Leu Val Leu Ser Ser Thr Leu Ala Cys Phe Thr Ser Cys Ser 
10 15 20 

act gtt ttt get gea act get gaa aat ata ggc ecc tct gat age ttt 211 
Thr Val Phe Ala Ala Thr Ala Glu Asn lie Gly Pro Ser Asp Ser Phe 
25 30 35 

gac gga agt act aae aca ggc ace tat act cet aaa aat aeg act act 259 
Asp Gly Ser Thr Asn Thr Gly Thr Tyr Thr Pro Lys Asn Thr Thr Thr 
40 45 50 

gga ata gac tat act ctg aca gga gat ata act ctg caa aac ctt ggg 307 
Gly lie Asp Tyr Thr Leu Thr Gly Asp He Thr Leu Gin Asn Leu Gly 
55 60 65 

gat teg gca get tta acg aag ggt tgt ttt tct gac act acg gaa tct 355 
Asp Ser Ala Ala Leu Thr Lys Gly Cys Phe Ser Asp Thr Thr Glu Ser 
70 75 80 85 

tta age ttt gcc ggt aag ggg tac tea ctt tct ttt tta aat att aag 403 
Leu Ser Phe Ala Gly Lys Gly Tyr Ser Leu Ser Phe Leu Asn He Lys 
90 95 100 

tct agt get gaa ggc gca gcc ctt tct gtt aca act gat aaa aat ctg 451 
Ser Ser Ala Glu Gly Ala Ala Leu Ser Val Thr Thr Asp Lys Asn Leu 
105 110 115 

teg eta aca gga ttt teg agt ctt act ttc tta gcg gcc cea tea teg 499 
Ser Leu Thr Gly Phe Ser Ser Leu Thr Phe Leu Ala Ala Pro Ser Ser 
120 125 130 

gta ate aca acc ecc tea gga aaa ggt gca gtt aaa tgt gga ggg gat 547 
Val He Thr Thr Pro Ser Gly Lys Gly Ala Val Lys Cys Gly Gly Asp 
135 140 . 145 

ctt aca ttt gat aac aat gga act att tta ttt aaa caa gat tac tgt 595 
Leu Thr Phe Asp Asn Asn Gly Thr He Leu Phe Lys Gin Asp Tyr Cys 
150 155 160 165 

gag gaa aat ggc gga gcc att tct acc aag aat ctt tct 'ttg aaa aac 643 
Glu Glu Asn Gly Gly Ala He Ser Thr Lys Asn Leu Ser Leu Lys Asn 
170 175 180 

age aeg gga teg att tct ttt gaa ggg aat aaa teg age gea aca ggg 691 
Ser Thr Gly Ser He Ser Phe Glu Gly Asn Lys Ser Ser Ala Thr Gly 
185 190 195 

aaa aaa ggt ggg get att tgt get act ggt act gta gat att aca aat 739 
Lys Lys Gly Gly Ala He Cys Ala Thr Gly Thr Val Asp He Thr Asn 
200 205 210 
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aat acq get cct acc etc tte teg aac aat att get gaa get gca ggt 787 
Asn Thr Ala Pro Thr Leu Phe Ser Asn Asn He Ala Glu Ala Ala Gly 
215 220 225 

gga get ata aat age aca gga aac tgt aca att aca ggg aat acg tet 835 
Gly Ala lie Asn Ser Thr Gly Asn Cys Thr lie Thr Gly Asn Thr Ser 
230 235 240 245 

ett gta ttt tet gaa aat agt gtg aca gcg acc gca gga aat gga gga 883 
Leu Val Phe Ser Glu Asn Ser Val Thr Ala Thr Ala Gly Asn Gly Gly 
250 255 260 

get ett tet gga gat gee gat gtt acc ata tet ggg aat cag agt gta 931 
Ala Leu Ser Gly Asp Ala Asp Val Thr Hie Ser Gly Asn Gin Ser Val 
265 270 275 

act tte tea gga aac caa get gta get aat gge gga gee att tat get 979 
Thr Phe Ser Gly Asn Gin Ala Val Ala Asn Gly Gly Ala He Tyr Ala 
280 285 290 

aag aag ett aca ctg get tee ggg ggg ggg ggg ggg aat cec ttt tet 1027 
Lys Lys Leu Thr Leu Ala Ser Gly Gly Gly Gly Gly Asn Pro Phe Ser 
295 300 305 

aac aat ata gte caa ggt ace act gca ggt aat ggt gga gee att tet 1075 
Asn Asn He Val Gin Gly Thr Thr Ala Gly Asn Gly Gly Ala He Ser 
310 315 320 325 

ata etg gca get gga gag tgt agt ett tte age gaa gca ggg gac cat 1123 
He Leu Ala Ala Gly Glu Cys Ser Leu Phe Ser Glu Ala Gly Asp His 
330 335 340 

tac ett aat ggg aat gee att gtt gea act aca cea caa act aca aaa 1171 
Tyr Leu Asn Gly Asn Ala He Val Ala Thr Thr Pro Gin Thr Thr Lys 
345 350 355 

aga aat tet att gac ata gga tet act gge aaa gat cae gaa tta egt 1219 
Arg Asn Ser He Asp He Gly Ser Thr Gly Lys Asp His Glu Leu Arg 
360 365 370 

gca ata tet ggg cat age ate ttt tte tac gat ccg att act get aat 1267 
Ala He Ser Gly His Ser He Phe Phe Tyr Asp Pro He Thr Ala Asn 
375 380 385 

acg get gcg gat tet aca gat act tta aat etc aat aag get gat gea 1315 
Thr Ala Ala Asp Ser Thr Asp Thr Leu Asn Leu Asn Lys Ala Asp Ala 
390 395 400 405 

ggt aat agt aca gat tat agt ggg teg att gtt ttt tet ggt gaa aag 1363 
Gly Asn Ser Thr Asp Tyr Ser Gly Ser He Val Phe Ser Gly Glu Lys 
410 415 420 

etc tet gaa gat gaa gea aaa gtt gca gac aac etc act tet acg ctg 1411 
Leu Ser Glu Asp Glu Ala Lys Val Ala Asp Asn Leu Thr Ser Thr Leu 
425 430 435 



8 



SUBSTITUTE SHEET (RULE 26) 



wo 00)32794 



PCT/CA99/01147 



aag cag cct gta act eta act gca gga aat tta gta ctt aaa cgt ggt 1459 

Lys Gin Pro Val Thr Leu Thr Ala Gly Asn Leu Val Leu Lys Arg Gly 
440 445 450 

gtc act etc gat acg aaa ggc ttt act cag acc gcg ggt tec tct gtt 1507 

Val Thr Leu Asp Thr Lys Gly Phe Thr Gin Thr Ala Gly Ser Ser Val 
455 460 465 

att atg gat gcg ggc aca acg tta aaa gca agt aca gag gag gtc act 1555 

lie Met Asp Ala Gly Thr Thr Leu Lys Ala Ser Thr Glu Glu Val Thr 

470 475 480 485 

tta aca ggt ctt tec att cct gta gac tct tta ggc gag ggt aag aaa 1603 

Leu Thr Gly Leu Ser lie Pro Val Asp Ser Leu Gly Glu Gly Lys Lys 
490 495 . 500 

gtt gta att get get tct gca gca agt aaa aat gta gee ctt agt ggt 1651 

Val Val lie Ala Ala Ser Ala Ala Ser Lys Asn Val Ala Leu Ser Gly 
505 510 515 

ccg att ctt ctt ttg gat aac caa ggg aat get tat gaa aat cac gac 1699 

Pro lie Leu Leu Leu Asp Asn Gin Gly Asn Ala Tyr Glu Asn His Asp 
520 525 530 

tta gga aaa act caa gac ttt tea ttt gtg cag etc tct get ctg ggt 1747 

Leu Gly Lys Thr Gin Asp Phe Ser Phe Val Gin Leu Ser Ala Leu Gly 
535 540 545 

act gca aca act aca gat gtt cca gcg gtt cct aca gta gca act cct 1795 

Thr Ala Thr Thr Thr Asp Val Pro Ala Val Pro Thr Val Ala Thr Pro 

550 555 560 565 

acg cac tat ggg tat caa ggt act tgg gga atg act tgg gtt gat gat 1843 

Thr His Tyr Gly Tyr Gin Gly Thr Trp Gly Met Thr Trp Val Asp Asp 
570 575 580 

ace gca age act cea aag act aag aca gcg aca tta get tgg ace aat 1891 

Thr Ala Ser Thr Pro Lys Thr Lys Thr Ala Thr Leu Ala Trp Thr Asn 
585 590 595 

aca ggc tac ctt ccg aat cct gag cgt caa gga cct tta gtt cct aat 1939 

Thr Gly Tyr Leu Pro Asn Pro Glu Arg Gin Gly Pro Leu Val Pro Asn 
600 605 610 

age ctt tgg gga tct ttt tea gac ate caa gcg att caa ggt gtc ata 1987 

Ser Leu Trp Gly Ser Phe Ser Asp lie Gin Ala lie Gin Gly Val He 
615 620 625 

gag aga agt get ttg act ett tgt tea gat cga ggc tte tgg get gcg 2035 

Glu Arg Ser Ala Leu Thr Leu Cys Ser Asp Arg Gly Phe Trp Ala Ala 

630 635 640 645 

gga gtc gee aat tte tta gat aaa gat aag aaa ggg gaa aaa cge aaa 2083 

Gly Val Ala Asn Phe Leu Asp Lys Asp Lys Lys Gly Glu Lys Arg Lys 
650 655 660 
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tac cgt cat aaa tct ggt gga tat get ate gga ggt gca gcg caa act 2131 
Tyr Arg His Lys Ser Gly Gly Tyr Ala lie Gly Gly Ala Ala Gin Thr 
665 670 675 

tgt tct gaa aac tta att age ttt gee ttt tgc caa etc ttt ggt age 2179 
Cys Ser Glu Asn Leu lie Ser Phe Ala Phe Cys Gin Leu Phe Gly Ser 
680 685 690 

gat aaa gat ttc tta gtc get aaa aat cat act gat ace tat gca gga 2227 
Asp Lys Asp Phe Leu Val Ala Lys Asn His Thr Asp Thr Tyr Ala Gly 
695 700 705 

gee ttc tat ate caa cac att aca gaa tgt agt ggg ttc ata ggt tgt 2275 
Ala Phe Tyr lie Gin His lie Thr Glu Cys Ser Gly Phe lie Gly Cys 
710 715 720 725 

etc tta gat aaa ctt cct ggc tct tgg agt cat aaa ccc etc gtt tta 2323 
Leu Leu Asp Lys Leu Pro Gly Ser Trp Ser His Lys Pro Leu Val Leu 
730 735 740 

gaa ggg cag etc get tat age cac gtc agt aat gat ctg aag aca aag 2371 
Glu Gly Gin Leu Ala Tyr Ser His Val Ser Asn Asp Leu Lys Thr Lys 
745 750 755 

tat act gcg tat cct gag gtg aaa ggt tct tgg ggg aat aat get ttt 2419 
Tyr Thr Ala Tyr Pro Glu Val Lys Gly Ser Trp Gly Asn Asn TVla Phe 
760 765 770 

aac atg atg ttg gga get tct tct eat tct tat cct gaa tac ctg cat 2467 
Asn Met Met Leu Gly Ala Ser Ser His Ser Tyr Pro Glu Tyr Leu His 
775 780 785 

tgt ttt gat ace tat get cea tac ate aaa ctg aat ctg acc tat ata 2515 
Cys Phe Asp Thr Tyr Ala Pro Tyr He Lys Leu Asn Leu Thr Tyr He 
790 795 800 805 

cgt cag gac age ttc teg gag aaa ggt aca gaa gga aga tct ttt gat 2563 
Arg Gin Asp Ser Phe Ser Glu Lys Gly Thr Glu Gly Arg Ser Phe Asp 
810 815 820 

gac age aac etc ttc aat tta tct ttg cct ata ggg gtg aag ttt gag 2611 
Asp Ser Asn Leu Phe Asn Leu Ser Leu Pro He Gly Val Lys Phe Glu 
825 830 835 

aag ttc tct gat tgt aat gac ttt tct tat gat ctg act tta tec tat 2659 
Lys Phe Ser Asp Cys Asn Asp Phe Ser Tyr Asp Leu Thr Leu Ser Tyr 
840 845 850 

gtt cct gat ctt ate cgc aat gat ccc aaa tgc act aca gca ctt gta 2707 
Val Pro Asp Leu He Arg Asn Asp Pro Lys Cys Thr Thr Ala Leu Val 
855 860 865 

ate age gga gcc tct tgg gaa act tat gcc aat aac tta gca cga cag 2755 
He Ser Gly Ala Ser Trp Glu Thr Tyr Ala Asn Asn Leu Ala Arg Gin 
870 875 880 885 
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gcc ttg caa gtg cgt gca ggc agt cac tac gcc ttc tct cct atg ttt 2803 
Ala Leu Gin Val Arg Ala Gly Ser His Tyr Ala Phe Ser Pro Met Phe 
890 895 900 

gaa gtg etc ggc cag ttt gtc ttt gaa gtt cgt gga tec tea egg att 2851 
Glu Val Leu Gly Gin Phe Val Phe Glu Val Arg Gly Ser Ser Arg lie 
905 910 915 

tat aat gta gat ctt ggg ggt aag tte caa ttc taggagcgtc tctcatgtct 2904 
Tyr Asn Val Asp Leu Gly Gly Lys Phe Gin Phe 
920 925 

eagaaattct gagagagatc gcatttagga ttttettaaa cacgac 2950 



<210>4 

<211>2950 

<212>DNA 

<213>Chlainydiapneuinoniae 

<220> 

<221>CDS 

<222>(101) . . (2784) 
<400>4 

atgaaatcgc aattttectg gttagtgctc tettcgacat tggcatgttt tactagttgt 60 

tccactgttt ttgctgcaae tgctgaaaat ataggcceet ctgatagett tgacggaagt 120 

actaacacag gcacctatac tectaaaaat aegactaetg gaatagacta tactctgaca 180 

ggagatataa ctctgcaaaa ccttggggat tcggeagctt taacgaaggg ttgtttttet 240 

gacactaegg aatctttaag ctttgecggt aaggggtaet cactttcttt tttaaatatt 300 

aagtctagtg ctgaaggcgc agccctttet gttacaactg ataaaaatct gtcgctaaca 360 

ggattttcga gtettacttt cttagcggcc ceatcatcgg taatcacaac cecetcagga 420 

aaaggtgcag ttaaatgtgg aggggatett aeatttgata acaatggaac tattttattt 480 

aaacaagatt actgtgagga aaatggcgga gecatttcta ccaagaatet ttctttgaaa 540 

aacagcacgg gatcgatttc ttttgaaggg aataaatcga gcgcaacagg gaaaaaaggt 600 

ggggctattt gtgctactgg tactgtagat attacaaata atacggctcc taccctcttc 660 

tcgaacaata ttgctgaagc tgeaggtgga gctataaata geaeaggaaa ctgtacaatt 720 

acagggaata cgtctcttgt attttctgaa aatagtgtga cagegaccgc aggaaatgga 780 

ggagctcttt ctggagatge cgatgttacc atatctggga ateagagtgt aactttctca 840 

ggaaaccaag ctgtagctaa tggcggagcc atttatgcta agaagettae aetggcttec 900 

9999999999 gggggaatcc ettttctaac aatatagtee aaggtaceae tgcaggtaat 960 

ggtggagcca tttctataet ggeagctgga gagtgtagte ttttcagcga agcaggggae 1020 

cattacctta atgggaatgc cattgttgea actaeaecae aaaetacaaa aagaaattca 1080 

ttgacatagg atctactggc aaagatcacg attacgtgca atatctgggc atagcatctt 1140 

tttctacgat ccgattactg ctaatacggc tgcggattet aeagatactt taaatctcaa 1200 

taaggctgat geaggtaata gtacagatta tagtgggteg attgtttttt ctggtgaaaa 1260 

gctctctgaa gatgaagcaa aagttgcaga caacctcaet tetacgctga agcagcctgt 1320 

aactctaact gcaggaaatt tagtacttaa acgtggtgte aetetegata egaaaggctt 1380 

tactcagacc gcgggttcct ct gtt at tat ggatgcgggc acaacgttaa aagcaagtae 1440 

agaggaggtc actttaacag gtctttecat tcctgtagac tctttaggcg agggtaagaa 1500 

agttgtaatt getgettctg eagcaagtaa aaatgtagcc ettagtggtc cgattcttet 1560 

tttggataac eaagggaatg cttatgaaaa tcacgactta ggaaaaactc aagacttttc 1620 

atttgtgcag ctctctgctc tgggtaetgc aaeaaetaca gatgttccag cggttcetac 1680 

agtagcaact cctacgcact atgggtatca aggtacttgg ggaatgactt gggttgatga 1740 

tacegcaagc aetccaaaga etaagacagc gacattagct tggaceaatc aggctacctt 1800 

ccgaatcctg agcgtcaagg acctttagtt cctaatagcc tttggggatc tttttcagac 1860 
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atccaagcga ttcaaggtgt catagagaga agtgctttga ctctttgttc agatcgaggc 19^0 

ttctgggctg cgggagtcgc caatttctta gataaagata agaaagggga aaaacgcaaa 1980 

taccgtcata aatctggtgg atatgctatc ggaggtgcag cgcaaacttg ttctgaaaac 2040 

ttaattagct ttgccttttg ccaactcttt ggtagcgata aagatttctt agtcgctaaa 2100 

aatcatactg atacctatgc aggagccttc tatatccaac acattacaga atgtagtggg 2160 

ttcataggtt gtctcttaga taaacttcct ggctcttgga gtcataaacc cctcgtttta 2220 

gaagggcagc tcgcttatag ccacgtcagt aatgatctga agacaaagta tactgcgtat 2280 

cctgaggtga aaggttcttg ggggaataat gcttttaaca tgatgttggg agcttcttct 2340 

cattcttatc ctgaatacct gcattgtttt gatacctatg ctccatacat caaactgaat 2400 

ctgacctata tacgtcagga cagcttctcg gagaaagtac agaaggaaga tcttttgatg 2460 

acagcaacct cttcaattta tctttgccta taggggtgaa gtttgagaag ttctctgatt 2520 

gtaatgactt ttcttatgat ctgactttat cctatgttcc tgatcttatc cgcaatgatc 2580 

ccaaatgcac tacagcactt gtaatcagcg gagcccactt cttgggaaac ttatgccaat 2640 

aacttagcac gacaggcctt gcaagtgcgt gcaggcagtc actacgcctt ctctcctatg 2700 

tttgaagtgc tcggccagtt tgtctttgaa gttcgtggat cctcacggat ttataatgta 2760 

gatcttgggg gtaagttcca attc 2784 



<210> 5 
<211> 2950 
<212> DNA 

<213> Chlamydia pneiamoniae 

<220> 

<221> CDS 

<222> (101) . . (2884) 
<400> 5 

tgtagattct taacttacta gtctctcctt tcctcttgct ttctttaatt tattgcagta 60 

tgtggtgaaa taatttgtta aaccacctat agccctctac atg aaa tec tct ctt 115 

Met Lys Ser Ser Leu 
1 5 

cat tgg ttt tta ate . teg tea tct tta gca ett ccc ttg tea eta aat 163 
His Trp Phe Leu He Ser Ser Ser Leu Ala Leu Pro Leu Ser Leu Asn 
10 15 20 

ttc tet geg ttt get get gtt gtt gaa ate aat eta gga cet ace aat 211 
Phe Ser Ala Phe Ala Ala Val Val Glu He Asn Leu Gly Pro Thr Asn 
25 30 35 

age tte tet gga eca gga aee tae act eet eea gee caa aea aea aat 259 
Ser Phe Ser Gly Pro Gly Thr Tyr Thr Pro Pro Ala Gin Thr Thr Asn 
40 45 50 



gca gat gga act ate tat aat eta aea ggg gat gtc tea ate ace aat 
Ala Asp Gly Thr He Tyr Asn Leu Thr Gly Asp Val Ser He Thr Asn 
55 60 65 



307 



gea gga tet eeg aea get eta aee get tec tge ttt aaa gaa act act 355 
Ala Gly Ser Pro Thr Ala Leu Thr Ala Ser Cys Phe Lys Glu Thr Thr 
70 75 80 85 
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ggg aat ctt tct ttc caa ggc cac ggc tac caa ttt etc eta caa aat 403 
Gly Asn Leu Ser Phe Gin Gly His Gly Tyr Gin Phe Leu Leu Gin Asn 
90 95 100 

ate gat geg gga gcg aac tgt acc ttt ace aat aca get gca aat aag 451 
lie Asp Ala Gly Ala Asn Cys Thr Phe Thr Asn Thr Ala Ala Asn Lys 
105 110 115 

ctt etc tec ttt tea gga ttc tec tat ttg tea eta ata caa aec acg 499 
Leu Leu Ser Phe Ser Gly Phe Ser Tyr Leu Ser Leu lie Gin Thr Thr 
120 125 130 

aat get aec aca gga aca gga gcc ate aag tec aca gga get tgt tct 547 
Asn Ala Thr Thr Gly Thr Gly Ala He Lys Ser Thr Gly Ala Cys Ser 
135 140 145 

att cag teg aae tat agt tge tac ttt ggc caa aac ttt tct aat gac 595 
He Gin Ser Asn Tyr Ser Cys Tyr Phe Gly Gin Asn Phe Ser TVsn Asp 
150 155 160 165 

aat gga ggc gcc etc caa ggc age tct ate agt eta teg eta aac ecc 643 
Asn Gly Gly Ala Leu Gin Gly Ser Ser He Ser Leu Ser Leu Asn Pro 
170 175 180 

aac eta acg ttt gcc aaa aac aaa gca acg caa aaa ggg ggt gcc etc 691 
Asn Leu Thr Phe Ala Lys Asn Lys Ala Thr Gin Lys Gly Gly Ala Leu 
185 ' 190 195 

tat tee acg gga ggg att aca att aac aat acg tta aac tea gca tea 739 
Tyr Ser Thr Gly Gly He Thr He Asn Asn Thr Leu Asn Ser Ala Ser 
200 205 210 

ttt tct gaa aat acc gcg gcg aac aat ggc gga gcc att tac acg gaa 787 
Phe Ser Glu Asn Thr Ala Ala Asn Asn Gly Gly Ala He Tyr Thr Glu 
215 220 225 

get age agt ttt att age age aac aaa gea att age ttt ata aac aat 835 
Ala Ser Ser Phe He Ser Ser Asn Lys Ala He Ser Phe He Asn Asn 
230 235 240 245 

agt gtg acc gca acc tea get aca ggg gga gcc att tac tgt agt agt 883 
Ser Val Thr Ala Thr Ser Ala Thr Gly Gly Ala He Tyr Cys Ser Ser 
250 255 260 

aca tea gee ecc aaa cea gtc tta act eta tea gac aac ggg gaa ctg 931 
Thr Ser Ala Pro Lys Pro Val Leu Thr Leu Ser Asp Asn Gly Glu Leu 
265 270 275 

aac ttt ata gga aat aca gca att act agt ggt ggg gcg att tat act 979 
Asn Phe He Gly Asn Thr Ala He Thr Ser Gly Gly Ala He Tyr Thr 
280 285 290 

gac aat eta gtt ctt tct tct gga gga cet acg ctt ttt aaa aac aac 1027 
Asp Asn Leu Val Leu Ser Ser Gly Gly Pro Thr Leu Phe Lys Asn Asn 
295 300 305 
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tct ggc tat gat act. gca get ccc tta gga gga gca att gcg att get 1075 

Ser Gly Tyr Asp Thr Ala Ala Pro Leu Gly Gly Ala He Ala He Ala 

310 315 320 325 

gac tct gga. tct ttg agt ctt teg get ctt ggt gga gac ate act ttt 1123 

Asp Ser Gly Ser Leu Ser Leu Ser Ala Leu Gly Gly Asp He Thr Phe 

330 335 340 

gaa gga aac aca gta gtc aaa gga get tct teg agt cag acc act acc 1171 

Glu Gly Asn Thr Val Val Lys Gly Ala Ser Ser Ser Gin Thr Thr Thr 

345 350 355 

aga aat tct att aac ate gga aac acc aat get aag att gta cag ctg 1219 

Arg Asn Ser He Asn He Gly Asn Thr Asn Ala Lys He Val Gin Leu 
360 365 370 

cga gee tct caa ggc aat act ate tac tte tat gat ect ata aca act 1267 

Arg Ala Ser Gin Gly Asn Thr He Tyr Phe Tyr Asp Pro He Thr Thr 
375 380 385 

age ate act gca get etc tea gat get eta aac tta aat ggt cct gac 1315 

Ser He Thr Ala Ala Leu Ser Asp Ala Leu Asn Leu Asn Gly Pro Asp 

390 395 400 405 

ctt gca ggg aat ect gca tat caa gga acc ate gta ttt tct gga gag 1363 

Leu Ala Gly Asn Pro Ala Tyr Gin Gly Thr He Val Phe Ser Gly Glu 

410 415 420 

aag etc teg gaa gca gaa get gca gaa get gat aat etc aaa tct aca 1411 

Lys Leu Ser Glu Ala Glu Ala Ala Glu Ala Asp Asn Leu Lys Ser Thr 

425 430 435 

att cag caa ect eta act ctt gcg gga ggg caa etc tct ctt aaa tea 1459 

He Gin Gin Pro Leu Thr Leu Ala Gly Gly Gin Leu Ser Leu Lys Ser 
440 445 450 

gga gtc act eta gtt get aag tec ttt teg caa tct ccg ggc tct acc 1507 

Gly Val Thr Leu Val Ala Lys Ser Phe Ser Gin Ser Pro Gly Ser Thr 
455 460 465 

etc etc atg gat gca ggg acc aca tta gaa ace get gat ggg ate act 1555 

Leu Leu Met Asp Ala Gly Thr Thr Leu Glu Thr Ala Asp Gly He Thr 

470 475 480 485 

ate aat aat ctt gtt etc aat gta gat tec tta aaa gag ace aag aag 1603 

He Asn Asn Leu Val Leu Asn Val Asp Ser Leu Lys Glu Thr Lys Lys 

490 495 500 

ggc aeg eta aaa gca aca caa gca agt cag aca gtc act tta tct gga 1651 

Gly Thr Leu Lys Ala Thr Gin Ala Ser Gin Thr Val Thr Leu Ser Gly 

505 510 515 

teg etc tct ett gta gat ect tct gga aat gtc tac gaa gat gtc tct 1699 
Ser Leu Ser Leu Val Asp Pro Ser Gly Asn Val Tyr Glu Asp Val Ser 
520 525 530 
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tgg aat aac cct caa gtc ttt tct tgt etc act ctt act get gac gac 1747 
Trp Asn Asn Pro Gin Val Phe Ser Cys Leu Thr Leu Thr Ala Asp Asp 
535 540 545 

ccc gcg aat att cac ate aca gac tta get get gat ccc eta gaa aaa 1795 
Pro Ala Asn lie His He Thr Asp Leu Ala Ala Asp Pro Leu Glu Lys 
550 555 560 565 

aat cct ate cat tgg gga tac caa ggg aat tgg gca tta tct tgg caa 1843 
Asn Pro He His Trp Gly Tyr Gin Gly Asn Trp Ala Leii Ser Trp Gin 
570 575 580 

gag gat act gcg act aaa tee aaa gca gcg act ctt acc tgg aca aaa 1891 
Glu Asp Thr Ala Thr Lys Ser Lys Ala Ala Thr Leu Thr Trp Thr Lys 
585 590 595 

aca gga tac aat ecg aat cct gag cgt cgt gga acc tta gtt get aae 1939 
Thr Gly Tyr Asn Pro Asn Pro Glu Arg Arg Gly Thr Leu Val Ala Asn 
600 605 610 

acg eta tgg gga tec ttt gtt gat gtg cgc tec ata caa cag ctt gta 1987 
Thr Leu Trp Gly Ser Phe Val Asp Val Arg Ser He Gin Gin Leu Val 
615 620 625 

gcc act aaa gta cgc caa tct caa gaa act cgc ggc ate tgg tgt gaa 2035 
Ala Thr Lys Val Arg Gin Ser Gin Glu Thr Arg Gly He Trp Cys Glu 
630 635 640 645 

ggg ate teg aac ttc ttc cat aaa gat age acg aag ata aat aaa ggt 2083 
Gly He Ser Asn Phe Phe His Lys Asp Ser Thr Lys He Asn Lys Gly 
650 655 660 

ttt cgc cac ata agt gca ggt tat gtt gta gga gcg act aca aca tta 2131 
Phe Arg His He Ser Ala Gly Tyr Val Val Gly Ala Thr Thr Thr Leu 
665 670 675 

get tct gat aat ctt ate act gca gee ttc tgc caa tta ttc ggg aaa 2179 
Ala Ser Asp Asn Leu He Thr Ala Ala Phe Cys Gin Leu Phe Gly Lys 
680 685 690 

gat aga gat cac ttt ata aat aaa aat aga get tct gee tat gca get 2227 
Asp Arg Asp His Phe He Asn Lys Asn Arg Ala Ser Ala Tyr Ala Ala 
695 700 705 

tct etc cat etc cag eat eta gcg acc ttg tct tct eca age ttg tta 2275 
Ser Leu His Leu Gin His Leu Ala Thr Leu Ser Ser Pro Ser Leu Leu 
710 715 720 725 

cgc tac ctt cct gga tct gaa agt gag cag cct gtc etc ttt gat get 2323 
Arg Tyr Leu Pro Gly Ser Glu Ser Glu Gin Pro Val Leu Phe Asp Ala 
730 735 740 

cag ate age tat ate tat agt aaa aat act atg aaa acc tat tac acc 2371 
Gin He Ser Tyr He Tyr Ser Lys Asn Thr Met Lys Thr Tyr Tyr Thr 
745 750 755 
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caa gca cca aag gga gag age teg tgg tat aat gac ggt tgc get ctg 2419 
Gin Ala Pro Lys Gly Glu Ser Ser Trp Tyr Asn Asp Gly Cys Ala Leu 
760 765 770 

gaa ctt gcg age tec eta cea eac act get tta age cat gag ggt etc 2467 
Glu Leu Ala Ser Ser Leu Pro His Thr Ala Leu Ser His Glu Gly Leu 
775 780 785 

ttc cac geg tat ttt ect ttc ate aaa gta gaa get teg tae ata cae 2515 
Phe His Ala Tyr Phe Pro Phe lie Lys Val Glu Ala Ser Tyr lie His 
790 795 800 805 

caa gat age ttc aaa gaa cgt aat act ace ttg gta cga tet ttc gat 2563 
Gin Asp Ser Phe Lys Glu Arg Asn Thr Thr Leu Val Arg Ser Phe Asp 
810 815 820 

age ggt gat tta att aac gtc tct gtg ect att gga att ace ttc gag 2611 
Ser Gly Asp Leu He Asn Val Ser Val Pro lie Gly He Thr Phe Glu 
825 830 835 

aga ttc teg aga aac gag cgt gcg tct tae gaa get act gtc ate tae 2659 
Arg Phe Ser Arg Asn Glu Arg Ala Ser Tyr Glu Ala Thr Val He Tyr 
840 845 850 

gtt gee gat gtc tat cgt aag aat ect gac tgc acg aca get etc eta 2707 
Val Ala Asp Val Tyr Arg Lys Asn Pro Asp Cys Thr Thr Ala Leu Leu 
855 860 865 

ate aac aat acc teg tgg aaa act aca gga acg aat etc tea aga caa 2755 
He Asn Asn Thr Ser Trp Lys Thr Thr Gly Thr Asn Leu Ser Arg Gin 
870 875 880 885 

get ggt ate gga aga gca ggg ate ttt tat gee ttc tet cca aat ctt 2803 
Ala Gly He Gly Arg Ala Gly He Phe Tyr Ala Phe Ser Pro Asn Leu 
890 895 900 

gag gtc aca agt aac eta tct atg gaa att cgt gga tet tea cgc age 2851 
Glu Val Thr Ser Asn Leu Ser Met Glu He Arg Gly Ser Ser Arg Ser 
905 910 915 

tae aat gca gat ctt gga ggt aag ttc cag ttc taaaagcgtt cetgatccet 2904 
Tyr Asn Ala Asp Leu Gly Gly Lys Phe Gin Phe 
920 925 

tagaaattet aagagatcct gagtgtatct agggaettct caaaga 2950 



<210>6 

<211>2784 

<212>DNA 

<2 13>Chlamydiapneuinoniae 

<220> 

<221>CDS 

<222>{1) (2784) 

<400>6 
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atgaaatcct ctcttcattg gtttttaatc tcgtcatctt tagcacttcc cttgtcacta 60 

aatttctctg cgtttgctgc tgttgttgaa atcaatctag gacctaccaa tagcttctct 120 

ggaccaggaa cctacactcc tccagcccaa acaacaaatg cagatggaac tatctataat 180 

ctaacagggg atgtctcaat caccaatgca ggatctccga cagctctaac cgcttcctgc 240 

tttaaagaaa ctactgggaa tctttctttc caaggccacg gctaccaatt tctcctacaa 300 

aatatcgatg cgggagcgaa ctgtaccttt accaatacag ctgcaaataa gcttctctcc 360 

ttttcaggat tctcctattt gtcactaata caaaccacga atgctaccac aggaacagga 420 

gccatcaagt ccacaggagc ttgttctatt cagtcgaact atagttgcta ctttggccaa 480 

aacttttcta atgacaatgg aggcgccctc caaggcagct ctatcagtct atcgctaaac 540 

cccaacctaa cgtttgccaa aaacaaagca acgcaaaaag ggggtgccct ctattccacg 600 

ggagggatta caattaacaa tacgttaaac tcagcatcat tttctgaaaa taccgcggcg 660 

aacaatggcg gagccattta cacggaagct agcagtttta ttagcagcaa caaagcaatt 720 

agctttataa acaatagtgt gaccgcaacc tcagctacag ggggagccat ttactgtagt 780 

agtacatcag cccccaaacc agtcttaact ctatcagaca acggggaact gaactttata 840 

ggaaatacag caattactag tggtggggcg atttatactg acaatctagt tctttcttct 900 

ggaggaccta cgctttttaa aaacaactct ggctatgata ctgcagctcc cttaggagga 960 

gcaattgcga ttgctgactc tggatctttg agtctttcgg ctcttggtgg agacatcact 1020 

tttgaaggaa acacagtagt caaaggagct tcttcgagtc agaccactac cagaaattct 1080 

attaacatcg gaaacaccaa tgctaagatt gtacagctgc gagcctctca aggcaatact 1140 

atctacttct atgatcctat aacaactagc atcactgcag ctctctcaga tgctctaaac 1200 

ttaaatggtc ctgaccttgc agggaatcct gcatatcaag gaaccatcgt attttctgga 1260 

gagaagctct cggaagcaga agctgcagaa gctgataatc tcaaatctac aattcagcaa 1320 

cctctaactc ttgcgggagg gcaactctct cttaaatcag gagtcactct agttgctaag 1380 

tccttttcgc aatctccggg ctctaccctc ctcatggatg cagggaccac attagaaacc 1440 

gctgatggga tcactatcaa taatcttgtt ctcaatgtag attccttaaa agagaccaag 1500 

aagggcacgc taaaagcaac acaagcaagt cagacagtca ctttatctgg atcgctctct 1560 

cttgtagatc cttctggaaa tgtctacgaa gatgtctctt ggaataaccc tcaagtcttt 1620 

tcttgtctca ctcttactgc tgacgacccc gcgaatattc acatcacaga cttagctgct 1680 

gatcccctag aaaaaaatcc tatccattgg ggataccaag ggaattgggc attatcttgg 1740 

caagaggata ctgcgactaa atccaaagca gcgactctta cctggacaaa aacaggatac 1800 

aatccgaatc ctgagcgtcg tggaacctta gttgctaaca cgctatgggg atcctttgtt 1860 

gatgtgcgct ccatacaaca gcttgtagcc actaaagtac gccaatctca agaaactcgc 1920 

ggcatctggt gtgaagggat ctcgaacttc ttccataaag atagcacgaa gataaataaa 1980 

ggttttcgcc acataagtgc aggttatgtt gtaggagcga ctacaacatt agcttctgat 2040 

aatcttatca ctgcagcctt ctgccaatta ttcgggaaag atagagatca ctttataaat 2100 

aaaaatagag cttctgccta tgcagcttct ctccatctcc agcatctagc gaccttgtct 2160 

tctccaagct tgttacgcta ccttcctgga tctgaaagtg agcagcctgt cctctttgat 2220 

gctcagatca gctatatcta tagtaaaaat actatgaaaa cctattacac ccaagcacca 2280 

aagggagaga gctcgtggta taatgacggt tgcgctctgg aacttgcgag ctccctacca 2340 

cacactgctt taagccatga gggtctcttc cacgcgtatt ttcctttcat caaagtagaa 2400 

gcttcgtaca tacaccaaga tagcttcaaa gaacgtaata ctaccttggt acgatctttc 24 60 

gatagcggtg atttaattaa cgtctctgtg cctattggaa ttaccttcga gagattctcg 2520 

agaaacgagc gtgcgtctta cgaagctact gtcatctacg ttgccgatgt ctatcgtaag 2580 

aatcctgact gcacgacagc tctcctaatc aacaatacct cgtggaaaac tacaggaacg 2640 

aatctctcaa gacaagctgg tatcggaaga gcagggatct tttatgcctt ctctccaaat 2700 

cttgaggtca caagtaacct atctatggaa attcgtggat cttcacgcag ctacaatgca 2760 

gatcttggag gtaagttcca gttc 2784 
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<210> 7 
<211> 3000 
<212> DNA 

<213> Chlamydia pneumoniae 

<220> 

<221> CDS 

<222> (101) (2890) 
<400> 7 

gtacgaagtt cttcacgaaa ttataataca aacctaggct ctaagttttg tttctagatt 60 



atcgaaaacg tgttaattaa ttgaacccaa gcatctttct atg aaa ata ccc ttg 

Met Lys lie Pro Leu 
1 5 



115 



cac aaa etc ctg ate tct teg aet ctt. gtc act ccc att eta ttg age 
His Lys Leu Leu He Ser Ser Thr Leu Val Thr Pro lie Leu Leu Ser 
10 15 20 



163 



att gca aet tac gga gea gat get tct tta tee eet aea gat age ttt 
He Ala Thr Tyr Gly Ala Asp Ala Ser Leu Ser Pro Thr Asp Ser Phe 
25 30 35 



211 



gat gga gcg ggc ggc tct aea ttt act cca aaa tct aea gca gat gee 
Asp Gly Ala Gly Gly Ser Thr Phe Thr Pro Lys Ser Thr Ala Asp Ala 
40 45 50 



259 



aat gga acg aac tat gtc tta tea gga aat gtc tat ata aac gat get 
Asn Gly Thr Asn Tyr Val Leu Ser Gly Asn Val Tyr He Asn Asp Ala 
55 60 65 



307 



ggg aaa ggc aca gca tta aea ggc tgc tge ttt aca gaa act acg ggt 
Gly Lys Gly Thr Ala Leu Thr Gly Cys Cys Phe Thr Glu Thr Thr Gly 
70 75 80 85 



355 



gat ctg aca ttt act gga aag gga tac tea ttt tea ttc aac acg gta 
Asp Leu Thr Phe Thr Gly Lys Gly Tyr Ser Phe Ser Phe Asn Thr Val 
90 95 100 



403 



gat gcg ggt teg aat gca gga get gcg gca age aca act get gat aaa 
Asp Ala Gly Ser Asn Ala Gly Ala Ala Ala Ser Thr Thr Ala Asp Lys 
105 110 115 



451 



gee eta ate ttc aca gga ttt tct aac ctt tec ttc att gca get cct 
Ala Leu He Phe Thr Gly Phe Ser Asn Leu Ser Phe He Ala Ala Pro 
120 125 130 



499 



gga aet aca gtt get tea gga aaa agt act tta agt tct gca gga gee 
Gly Thr Thr Val Ala Ser Gly Lys Ser Thr Leu Ser Ser Ala Gly Ala 
135 140 145 



547 



tta aat ctt acc gat aat gga acg att etc ttt age caa aac gtc tec 
Leu Asn Leu Thr Asp Asn Gly Thr He Leu Phe Ser Gin Asn Val Ser 
150 155 160 165 



595 
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aat gaa get aat aac aat ggc gga gcg ate aec aca aaa act ett tct 643 
Asn Glu Ala Asn Asn Asn Gly Gly Ala He Thr Thr Lys Thr Leu Ser 
170 175 180 

att tct ggg aat acc tct tct ata acc ttc act agt aat age gca aaa 691 
He Ser Gly Asn Thr Ser Ser He Thr Phe Thr Ser Asn Ser Ala Lys 
185 190 195 

aaa tta ggt gga gcg ate tat age tct gcg get gca agt att tea gga 739 
Lys Leu Gly Gly Ala He Tyr Ser Ser Ala Ala Ala Ser He Ser Gly 
200 205 210 

aac acc ggc cag tta gtc ttt atg aat aat aaa gga gaa act ggg ggt 787 
Asn Thr Gly Gin Leu Val Phe Met Asn Asn Lys Gly Glu Thr Gly Gly 
215 220 225 

ggg get etg ggc ttt gaa gee age tec teg att act caa aat age tec 835 
Gly Ala Leu Gly Phe Glu Ala Ser Ser Ser He Thr Gin Asn Ser Ser 
230 235 240 245 

ett ttc ttc tct gga aac act gca aca gat get gca ggc aag ggc ggg 883 
Leu Phe Phe Ser Gly Asn Thr Ala Thr Asp Ala Ala Gly Lys Gly Gly 
250 255 260 

gee att tat tgt gaa aaa aca gga gag act cct act ett act ate tct 931 
Ala He Tyr Cys Glu Lys Thr Gly Glu Thr Pro Thr Leu Thr He Ser 
265 270 275 

gga aat aaa agt etg ace ttc gee gag aac tct tea gta act caa ggc 979 
Gly Asn Lys Ser Leu Thr Phe Ala Glu Asn Ser Ser Val Thr Gin Gly 
280 285 290 

gga gca ate tgt gee cat ggt eta gat ett tec get get ggc cct aec 1027 
Gly Ala He Cys Ala His Gly Leu Asp Leu Ser Ala Ala Gly Pro Thr 
295 300 305 

eta ttt tea aat aat aga tge ggg aac aca get gca ggc aag ggc ggc 1075 
Leu Phe Ser Asn Asn Arg Cys Gly Asn Thr Ala Ala Gly Lys Gly Gly 
310 315 320 325 

get att gca att gee gae tct gga tct tta agt etc tct gca aat caa 1123 
Ala He Ala He Ala Asp Ser Gly Ser Leu Ser Leu Ser Ala Asn Gin 
330 335 340 

gga gae ate acg ttc ett ggc aac act eta aec tea ace tec gcg cea 1171 
Gly Asp He Thr Phe Leu Gly Asn Thr Leu Thr Ser Thr Ser Ala Pro 
345 350 355 

aca teg aca egg aat get ate tac etg gga teg tea gca aaa att acg 1219 
Thr Ser Thr Arg Asn Ala He Tyr Leu Gly Ser Ser Ala Lys He Thr 
360 365 370 

aac tta agg gca gee caa ggc caa tct ate tat ttc tat gat ceg att 1267 
Asn Leu Arg Ala Ala Gin Gly Gin Ser He Tyr Phe Tyr Asp Pro He 
375 380 385 
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gca tct aac acc aca gga get tea gac gtt ctg acc ate aae caa ccg 1315 
Ala Ser Asn Thr Thr Gly Ala Ser Asp Val Leu Thr lie Asn Gin Pro 
390 395 400 405 

gat age aac teg ect tta gat tat tea gga aeg att gta ttt tct ggg 1363 
Asp Ser Asn Ser Pro Leu Asp Tyr Ser Gly Thr lie Val Phe Ser Gly 
410 415 420 

gaa aag etc tct gca gat gaa gcg aaa get get gat aae tte aca tct 1411 
Glu Lys Leu Ser Ala Asp Glu Ala Lys Ala Ala Asp Asn Phe Thr Ser 
425 430 435 

ata tta aag caa cea ttg get eta gee tct gga acc tta gca etc aaa 1459 
lie Leu Lys Gin Pro Leu Ala Leu Ala Ser Gly Thr Leu Ala Leu Lys 
440 445 . 450 

gga aat gtc gag tta gat gtc aat ggt ttc aca eag act gaa gge tct 1507 
Gly Asn Val Glu Leu Asp Val Asn Gly Phe Thr Gin Thr Glu Gly Ser 
455 460 465 

aca etc etc atg caa cea gga aea aag etc aaa gca gat act gaa get 1555 
Thr Leu Leu Met Gin Pro Gly Thr Lys Leu Lys Ala Asp Thr Glu Ala 
470 475 480 485 

ate agt ett ace aaa ett gtc gtt gat ett tet gee tta gag gga aat 1603 
lie Ser Leu Thr Lys Leu Val Val Asp Leu Ser Ala Leu Glu Gly Asn 
490 495 500 

aiag agt gtg tec att gaa aca gca gga gee aae aaa act ata act eta 1651 
Lys Ser Val Ser lie Glu Thr Ala Gly Ala Asn Lys Thr lie Thr Leu 
505 510 515 

acc tct cct ett gtt ttc caa gat agt age gge aat ttt tat gaa age 1699 
Thr Ser Pro Leu Val Phe Gin Asp Ser Ser Gly Asn Phe Tyr Glu Ser 
520 525 530 

eat acg ata aac caa gcc tte acg eag ect ttg gtg gta ttc act get 1747 
His Thr lie Asn Gin Ala Phe Thr Gin Pro Leu Val Val Phe Thr Ala 
535 540 545 

get act get get age gat att tat ate gat gcg ett etc act tet cea 1795 
Ala Thr Ala Ala Ser Asp lie Tyr lie Asp Ala Leu Leu Thr Ser Pro 
550 555 560 565 

gta caa act cea gaa cet cat tae ggg tat cag gga eat tgg gaa gee 1843 
Val Gin Thr Pro Glu Pro His Tyr Gly Tyr Gin Gly His Trp Glu Ala 
570 575 580 

act tgg gca gac aea tea act gca aaa tea gga act atg act tgg gta 1891 
Thr Trp Ala Asp Thr Ser Thr Ala Lys Ser Gly Thr Met Thr Trp Val 
585 590 595 

act acg gge tae aac cet aat cct gag cgt aga get tee gta gtt ccc 1939 
Thr Thr Gly Tyr Asn Pro Asn Pro Glu Arg Arg Ala Ser Val Val Pro 
600 605 610 
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gat tea tta tgg gca tec ttt act gac att cgc act eta cag cag ate 1987 
Asp Ser Leu Trp Ala Ser Phe Thr Asp lie Arg Thr Leu Gin Gin lie 
.615 620 625 

atg aca tct eaa gcg aat agt ate tat cag eaa ega gga etc tgg gca 2035 
Met Thr Ser Gin Ala Asn Ser He Tyr Gin Gin Arg Gly Leu Trp Ala 
630 635 640 645 

tea gga act gcg aat ttc ttc cat aag gat aaa tea gga act aac caa 2083 
Ser Gly Thr Ala Asn Phe Phe His Lys Asp Lys Ser Gly Thr Asn Gin 
650 655 660 

gca ttc cga cat aaa age tac ggc tat att gtt gga gga agt get gaa 2131 
Ala Phe Arg His Lys Ser Tyr Gly Tyr He Val Gly Gly Ser Ala Glu 
665 670 675 

gat ttt tct gaa aat ate ttc agt gta get ttc tgc cag etc ttc ggt 2179 
Asp Phe Ser Glu Asn He Phe Ser Val Ala Phe Cys Gin Leu Phe Gly 
680 685 690 

aaa gat aaa gac ctg ttt ata gtt gaa aat ace tct eat aae tat tta 2227 
Lys Asp Lys Asp Leu Phe He Val Glu Asn Thr Ser His Asn Tyr Leu 
695 700 705 

gcg teg eta tac ctg eaa cat cga gca ttc eta gga gga ctt ccc atg 2275 
Ala Ser Leu Tyr Leu Gin His Arg Ala Phe Leu Gly Gly Leu Pro Met 
710 715 720 725 

ccc tea ttt gga agt ate aec gac atg ctg aaa gat att ect etc att 2323 
Pro Ser Phe Gly Ser He Thr Asp Met Leu Lys Asp He Pro Leu He 
730 735 740 

ttg aat gcc cag eta age tac age tac act aaa aat gat atg gat act 2371 
Leu Asn Ala Gin Leu Ser Tyr Ser Tyr Thr Lys Asn Asp Met Asp Thr 
745 750 755 

cgc tat act tec tat ect gaa get caa ggc tct tgg ace aat aac tct 2419 
Arg Tyr Thr Ser Tyr Pro Glu Ala Gin Gly Ser Trp Thr Asn Asn Ser 
760 765 770 

ggg get eta gag etc gga gga tct ctg get eta tat etc ect aaa gaa 2467 
Gly Ala Leu Glu Leu Gly Gly Ser Leu Ala Leu Tyr Leu Pro Lys Glu 
775. 780 785 

gea ceg ttc ttc cag gga tat ttc ccc ttc tta aag ttc cag gca gtc 2515 
Ala Pro Phe Phe Gin Gly Tyr Phe Pro Phe Leu Lys Phe Gin Ala Val 
790 795 800 805 

tac age cgc caa caa aac ttt aaa gag agt ggc get gaa gee cgt get 2563 
Tyr Ser Arg Gin Gin Asn Phe Lys Glu Ser Gly Ala Glu Ala Arg Ala 
810 815 820 

ttt gat gat gga gac eta gtg aae tgc tct ate ect gtc ggc att egg 2611 
Phe Asp Asp Gly Asp Leu Val Asn Cys Ser He Pro Val Gly He Arg 
825 830 835 
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tta gaa aaa ate tec gaa gat gaa aaa aat aat ttc gag att tct eta 2659 
Leu Glu Lys lie Ser Glu Asp Glu Lys Asn Asn Phe Glu lie Ser Leu 
840 845 850 

gee tac att ggt gat gtg tat cgt aaa aat cee cgt teg cgt aet tct 2707 
Ala Tyr lie Gly Asp Val Tyr Arg Lys Asn Pro Arg Ser Arg Thr Ser 
855 860 865 

eta atg gtc agt gga gcc tct tgg act teg eta tgt aaa aac etc gca 2755 
Leu Met Val Ser Gly Ala Ser Trp Thr Ser Leu Cys Lys Asn Leu Ala 
870 875 880 885 

cga caa gcc ttc tta gca agt get gga age cat etg aet etc tec ect 2803 
Arg Gin Ala Phe Leu Ala Ser Ala Gly Ser His Leu Thr Leu Ser Pro 
890 895 900 

cat gta gaa etc tct ggg gaa get get tat gag ett cgt ggc tea gca 2851 
His Val Glu Leu Ser Gly Glu Ala Ala Tyr Glu Leu Arg Gly Ser Ala 
905 910 915 

eac ate tac aat gta gat tgt ggg eta aga tac tea ttc tagttcctac 2900 
His lie Tyr Asn Val Asp Cys Gly Leu hrg Tyr Ser Phe 
920 925 930 

tttectccct aaacttttag ggaggaattc ttataaaaac cetgtagatt ettaacttae 2960 

tagtetctcc tttcctettg ctttetttaa tttattgeag 3000 



<210>8 

<211>2790 

<212>DNA 

<2 1 3>Chla2nydiapneiunoniae 

<220> 

<221>CDS 

<222>(1) (2790) 

<400>8 

atgaaaatae ccttgcacaa actcctgatc tcttcgaete ttgtcactec cattetattg 60 

agcattgcaa ettaeggage agatgcttet ttatceecta eagatagett tgatggageg 120 

ggeggctcta eatttactcc aaaatetaca gcagatgeca atggaaegaa ctatgtetta 180 

tcaggaaatg tetatataaa cgatgctggg aaaggcacag cattaacagg ctgetgcttt 240 

acagaaacta cgggtgatct gacatttact ggaaagggat acteatttte attcaacacg 300 

gtagatgegg gttcgaatge aggagctgeg geaagcacaa etgctgataa ageeetaatc 360 

tteacaggat tttctaacct tteettcatt geagetectg gaactacagt tgettcagga 420 

aaaagtaett taagttctgc aggagectta aatcttaeeg ataatggaac gattetcttt 480 

agccaaaacg tetceaatga agctaataac aatggcggag cgatcaeeae aaaaactett 540 

tetatttetg ggaataccte ttctataace tteaetagta atagegcaaa aaaattaggt 600 

ggagcgatct atagetetgc ggetgeaagt atttcaggaa acaceggeea gttagtcttt 660 

atgaataata aaggagaaac tgggggtggg getctggget ttgaagecag etcctcgatt 720 

actcaaaata getccctttt cttctetgga aacactgeaa cagatgetge aggcaaggge 780 

ggggccattt attgtgaaaa aacaggagag actcctactc ttactatctc tggaaataaa 840 

agtctgacct tcgccgagaa etcttcagta actcaaggcg gagcaatctg tgcccatggt 900 

ctagatcttt ccgctgctgg cectaeccta ttttcaaata atagatgcgg gaacacagct 960 

gcaggcaagg geggcgctat tgeaattgee gactctggat ctttaagtct etetgcaaat 1020 
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caaggagaca tcacgttcct tggcaacact ctaacctcaa cctccgcgcc aacatcgaca 1080 

cggaatgcta tctacctggg atcgtcagca aaaattacga acttaagggc agcccaaggc 1140 

caatctatct atttctatga tccgattgca tctaacacca caggagcttc agacgttctg 1200 

accatcaacc aaccggatag caactcgcct ttagattatt caggaacgat tgtattttct 1260 

ggggaaaagc tctctgcaga tgaagcgaaa gctgctgata acttcacatc tatattaaag 1320 

caaccattgg ctctagcctc tggaacctta gcactcaaag gaaatgtcga gttagatgtc 1380 

aatggtttca cacagactga aggctctaca ctcctcatgc aaccaggaac aaagctcaaa 1440 

gcagatactg aagctatcag tcttaccaaa cttgtcgttg atctttctgc cttagaggga 1500 

aataagagtg tgtccattga aacagcagga gccaacaaaa ctataactct aacctctcct 1560 

cttgttttcc aagatagtag cggcaatttt tatgaaagcc atacgataaa ccaagccttc 1620 

acgcagcctt tggtggtatt actgctgcta ctgctgctag cgatatttat atcgatgcgc 1680 

ttctcacttc tccagtacaa actccagaac ctcattacgg gtarcaggga cattgggaag 1740 

ccacttgggc agacacatca actgcaaaat caggaactat gacttgggta actacgggct 1800 

acaaccctaa tcctgagcgt agagcttccg tagttcccga ttcattatgg gcatccttta 1860 

ctgacattcg cactctacag cagatcatga catctcaagc gaatagtatc tatcagcaac 1920 

gaggactctg ggcatcagga actgcgaatt tcttccataa ggataaatca ggaactaacc 1980 

aagcattccg acataaaagc tacggctata ttgttggagg aagtgctgaa gatttttctg 2040 

aaaatatctt cagtgtagct ttctgccagc tcttcggtaa agataaagac ctgtttatag 2100 

ttgaaaatac ctctcataac tatttagcgt cgctatacct gcaacatcga gcattcctag 2160 

gaggacttcc catgccctca tttggaagta tcaccgacat gctgaaagat attcctctca 2220 

ttttgaatgc ccagctaagc tacagctaca ctaaaaatga tatggatact cgctatactt 2280 

cctatcctga agctcaaggc tcttggacca ataactctgg ggctctagag ctcggaggat 2340 

ctcliggctct atatctccct aaagaagcac cgttcttcca gggatatttc cccttcttaa 2400 

agttccaggc agtctacagc cgccaacaaa actttaaaga gagtggcgct gaagcccgtg 2460 

cttttgatga tggagaccta gtgaactgct ctatccctgt cggcattcgg ttagaaaaaa 2520 

tctccgaaga tgaaaaaaat aatttcgaga tttctctagc ctacattggt gatgtgtatc 2580 

gtaaaaatcc ccgttcgcgt acttctctaa tggtctcagt ggagcctctt ggacttcgct 2640 

tagtaaaaac ctcgcacgac aagccttctt agcaagtgct ggaagccatc tgactctctc 2700 

cctcatgtag aactctctgg ggaagctgct tatgagcttc gtggctcagc acacatctac 2760 

aatgtagatt gtgggctaag atactcattc 2790 



<210> 9 
<211> 1100 
<212> DNA 

<213> Chlamydia pneumoniae 

<220> 

<221> CDS 

<222> (101) . . (979) 

<400> 9 

ataaagtttt ttatatgaac aaaactttga cttcattggc agacacttcg accttaacag 60 

accaattttg ttgtcatccc tataaaaatc aggaattttc atg etc tec tea eta 115 

Met Leu Ser Ser Leu 
1 5 

ate cgt gat tea ttt cce ett ctt att tta ett ccc aca tte eta gcg 163 
lie Arg Asp Ser Phe Pro Leu Leu He Leu Leu Pro Thr Phe Leu Ala 
10 15 20 



gca tta gga gee tee gta get gge ggc gtt atg gga acc tat ate gtt 
Ala Leu Gly Ala Ser Val Ala Gly Gly Val Met Gly Thr Tyr He Val 
25 30 35 



211 
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wo 00/32794 



PCT/CA99/D1147 



gta aaa cgt att gtt tea att agt gga ag:t ata tct cat gca att eta 259 
Val Lys Arg lie Val Ser lie Ser Gly Ser He Ser His Ala He Leu 
40 45 50 

gga gga att ggc etc acc eta tgg ata caa tat aag ett cat etc tct 307 
Gly Gly He Gly Leu Thr Leu Trp lie Gin Tyr Lys Leu His Leu Ser 
55 60 65 

ttt ttc cct atg tat gga get att gta gga get att ttt eta get ett 355 
Phe Phe Pro Met Tyr Gly Ala He Val Gly Ala He Phe Leu Ala Leu 
70 75 80 85 

tgc ate ggc aag ate cae etg aaa tac caa gaa agg gaa gac tct ttg 403 
Cys He Gly Lys He His Leu Lys Tyr Gin Glu Arg Glu Asp Ser Leu 
90 95 100 

att geg atg att tgg tct gtg ggc atg gca att gga att ata ttc att 451 
He Ala Met lie Trp Ser Val Gly Met Ala He Gly He He Phe He 
105 110 115 

tec agg ett ecc acc ttt a at gga gag etc ate aat ttt eta ttt ggg 499 
Ser TUrg Leu Pro Thr Phe Asn Gly Glu Leu He Asn Phe Leu Phe Gly 
120 125 130 

aac att etc tgg gtc acc cct tea gac etc tat age tta gga ate ttt 547 
Asn He Leu Trp Val Thr Pro Ser Asp Leu Tyr Ser Leu Gly He Phe 
135 140 145 

gat ett ett gtt tta gga att gtg gtc ett tgc cae acc egg ttc ett 595 
Asp Leu Leu Val Leu Gly He Val Val Leu Cys His Thr Arg Phe Leu 
150 155 160 165 

get ett tgc ttt gat gag agg tac aeg get tta aac cat tgt tct gta 643 
Ala Leu Cys Phe Asp Glu Arg Tyr Thr Ala Leu Asn His Cys Ser Val 
170 175 180 

cag etg tgg tat ttc eta ett ett gtt etg aea gca ate aeg att gtg 691 
Gin Leu Trp Tyr Phe Leu Leu Leu Val Leu Thr Ala He Thr He Val 
185 190 195 

atg ttg att tat gtg atg gga aeg att etg atg ett age atg etc gtc 739 
Met Leu He Tyr Val Met Gly Thr He Leu Met Leu Ser Met Leu Val 
200 205 210 

tta cct gtt get ata geg tgt aga ttt teg tac aag atg aea cga att 787 
Leu Pro Val Ala He Ala Cys Arg Phe Ser Tyr Lys Met Thr Arg He 
215 220 225 

atg ttc ate teg gtc etc ttg aat ate tta tgt tct ttt tct gga att 835 
Met Phe He Ser Val Leu Leu Asn He Leu Cys Ser Phe Ser Gly He 
230 235 240 245 

tgc ate gee tac tgt tta gat ttc cca gta ggt cct aeg ata tea ttg 883 
Cys He Ala Tyr Cys Leu Asp Phe Pro Val Gly Pro Thr He Ser Leu 
250 255 260 
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ctg atg ggg tta ggt tat aca gcg agt ctt tgt gtg aag aag egg tac 931 
Leu Met Gly Leu Gly Tyr Thr Ala Ser Leu Cys Val Lys Lys Arg Tyr 
265 270 275 

aat ccg teg acg cct tct cct gta agt cct gaa ate aat aca aat gta 979 
Asn Pro Ser Thr Pro Ser Pro Val Ser Pro Glu lie Asn Thr Asn Val 
280 285 290 

tagctaggga agegcttttg gaagctttgg aggeattctt cctgttcgtc aggaagaaga 1039 

tcatcaattt tatttaaagc taccagcata tctttctttt caaaatctgg ctgatgagag 1099 

t 1100 



<210> 10 
<211> 880 
<212> DNA 

<213> Chlamydiapneumoniae 

<220> 

<221> CDS 

<222> (1) (880) 

<400> 10 

atgctctcct cactaatccg tgattcattt cecettctta ttttacttcc cacattceta 60 
gcggcattag gagcctccgt agctggcggc gttatgggaa ectatategt tgtaaaacgt 120 
attgtttcaa ttagtggaag tatatctcat gcaattctag gaggaattgg cctcacccta 180 
tggatacaat ataagcttca tctctctttt ttccctatgt atggagctat tgtaggagct 240 
atttttctag ctctttgcat cggcaagatc cacctgaaat accaagaaag ggaagactct 300 
ttgattgcga tgatttggte tgtgggcatg gcaattggaa ttatatteat' ttecaggctt 360 
cccaccttta atggagagct catcaatttt ctatttggga acattctctg ggtcacccct 420 
tcagacctct atagcttagg aatctttgat cttcttgttt taggaattgt ggtcetttge 480 
cacacccggt tccttgctct ttgctttgat gagaggtaca cggctttaaa ccattgttct 540 
gtacagctgt ggtatttcct acttcttgtt ctgacagcaa teacgattgt gatgttgatt 600 
tatgtgatgg gaacgattct gatgcttagc atgctcgtct tacctgttgc tatagcgtgt 660 
agattttcgt acaagatgac acgaattatg ttcatctcgg tcctcttgaa tatettatgt 720 
tctttttctg gaatttgcat egcetactgt ttagatttec cagtaggtcc tacgatatca 780 
ttgctgatgg ggttaggtta tacagcgagt ctttgtgtag aagaagcggt acaatccgte 840 
gacgccttct eetgtaagtc ctgaaateaa tacaaatgta 880 



<210> 11 
<211> 928 
<212> PRT 

<213> Chlamydia pneumoniae 
<400> 11 

Met Lys Thr Ser He Pro Trp Val Leu Val Ser Ser Val Leu Ala Phe 
1 5 10 15 

Ser Cys His Leu Gin Ser Leu Ala Asn Glu Glu Leu Leu Ser Pro Asp 
20 25 30 
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Asp Ser Phe Asn Gly Asn He Asp Ser Gly Thr Phe. Thr Pro Lys Thr 
35 40 45 



Ser Ala Thr Thr Tyr Ser Leu Thr 
50 55 

Gly Lys Gly Thr Pro Leu Ser Asp 
65 70 

Asn Leu Thr Phe Leu Gly Asn Gly 
85 

Asp Ala Gly Thr His Ala Gly Ala 
100 



Gly Asp Val Phe Phe Tyr Glu Pro 
60 

Ser Cys Phe Lys Gin Thr Thr Asp 
75 80 

His Ser Leu Thr Phe Gly Phe He 
90 95 

Ala Ala Ser Thr Thr Ala Asn Lys 
105 110 



Asn Leu Thr Phe Ser Gly Phe Ser 
115 120 

Ser Thr Thr Val Thr Thr Gly Gin 
130 135 

Val Asn Leu Glu Asn He Arg Lys 
145 150 

Thr Ala Asp Gly Gly Ala He Lys 
165 



Leu Leu Ser Phe Asp Ser Ser Pro 
125 

Gly Thr Leu Ser Ser Ala Gly Gly 
140 

Leu Val Val Ala Gly Asn Phe Ser 
155 160 

Gly Ala Ser Phe Leu Leu Thr Gly 
170 175 



Thr Ser Gly Asp Ala Leu Phe Ser Asn Asn Ser Ser Ser Thr Lys Gly 
180 185 190 

Gly Ala He Ala Thr Thr Ala Gly Ala Arg He Ala Asn Asn Thr Gly 
195 200 205 

Tyr Val Arg Phe Leu Ser Asn He Ala Ser Thr Ser Gly Gly Ala He 
210 215 220 

Asp Asp Glu Gly Thr Ser He Leu Ser Asn Asn Lys Phe Leu Tyr Phe 
225 230 235 240 

Glu Gly Asn Ala Ala Lys Thr Thr Gly Gly Ala He Cys Asn Thr Lys 
245 250 255 

Ala Ser Gly Ser Pro Glu Leu He He Ser Asn Asn Lys Thr Leu He 
260 265 270 

Phe Ala Ser Asn Val Ala Glu Thr Ser Gly Gly Ala He His Ala Lys 
275 280 285 

Lys Leu Ala Leu Ser Ser Gly Gly Phe Thr Glu Phe Leu Arg Asn Asn 
290 295 300 

Val Ser Ser Ala Thr Pro Lys Gly Gly Ala He Ser He Asp Ala Ser 
305 310 315 320 

Gly Glu Leu Ser Leu Ser Ala Glu Thr Gly Asn lie Thr Phe Val Arg 
325 330 335 
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Asn Thr Leu Thr Thr Thr Gly Ser 
340 

lie Asn lie Gly Ser Asn Gly Lys 
355 360 

TVsn His Thr lie Phe Phe Tyr Asp 
370 375 

Ser Asp Val Leu Lys He Asn Asn 
385 390 



Thr Asp Thr Pro Lys Arg Asn Ala 
345 350 

Phe Thr Glu Leu Arg Ala Ala Lys 
365 

Pro He Thr Ser Glu Gly Thr Ser 
380 

Gly Ser Ala Gly Ala Leu Asn Pro 
395 400 



Tyr Gin Gly Thr He Leu Phe Ser Gly Glu Thr Leu Thr Ala Asp Glu 
405 410 415 

Leu Lys Val Ala Asp Asn Leu Lys Ser. Ser Phe Thr Gin Pro Val Ser 
420 425 430 

Leu Ser Gly Gly Lys Leu Leu Leu Gin Lys Gly Val Thr Leu Glu Ser 
435 440 445 

Thr Ser Phe Ser Gin Glu Ala Gly Ser Leu Leu Gly Met Asp Ser Gly 
450 455 460 

Thr Thr Leu Ser Thr Thr Ala Gly Ser He Thr He Thr Asn Leu Gly 
465 470 475 480 

He Asn Val Asp Ser Leu Gly Leu Lys Gin Pro Val Ser Leu Thr Ala 
485 490 495 

Lys Gly Ala Ser Asn Lys Val He Val Ser Gly Lys Leu Asn Leu He 
500 505 510 

Asp He Glu Gly Asn He Tyr Glu Ser His Met Phe Ser His Asp Gin 
515 520 525 

Leu Phe Ser Leu Leu Lys He Thr Val Asp Ala Asp Val Asp Thr Asn 
530 535 540 

Val Asp He Ser Ser Leu He Pro Val Pro Ala Glu Asp Pro Asn Ser 
545 550 555 560 

Glu Tyr Gly Phe Gin Gly Gin Trp Asn Val Asn Trp Thr Thr Asp Thr 
565 570 575 

Ala Thr Asn Thr Lys Glu Ala Thr Ala Thr Trp Thr Lys Thr Gly Phe 
580 585 590 

Val Pro Ser Pro Glu Arg Lys Ser Ala Leu Val Cys Asn Thr Leu Trp 
595 600 605 

Gly Val Phe Thr Asp He Arg Ser Leu Gin Gin Leu Val Glu He Gly 
610 615 620 

Ala Thr Gly Met Glu His Lys Gin Gly Phe Trp Val Ser Ser Met Thr 
625 630 635 640 
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Asn Phe Leu His Lys Thr Gly Asp Glu Asn Arg Lys Gly Phe Arg His 
645 650 655 

Thr Ser Gly Gly Tyr Val lie Gly Gly Ser Ala His Thr Pro Lys Asp 
660 665 670 

Asp Leu Phe Thr Phe Ala Phe Cys His Leu Phe Ala Arg Asp Lys Asp 
675 680 685 

Cys Phe lie Ala His Asn Asn Ser Arg Thr Tyr Gly Gly Thr Leu Phe 
690 695 700 

Phe Lys His Ser His Thr Leu Gin Pro Gin Asn Tyr Leu Arg Leu Gly 
705 710 715 720 

Arg Ala Lys Phe Ser Glu Ser Ala lie Glu Lys Phe Pro Arg Glu lie 
725 730 735 

Pro Leu Ala Leu Asp Val Gin Val Ser Phe Ser His Ser Asp Asn Arg 
740 745 750 

Met Glu Thr His Tyr Thr Ser Leu Pro Glu Ser Glu Gly Ser Trp Ser 
755 760 765 

Asn Glu Cys lie Ala Gly Gly lie Gly Leu Asp Leu Pro Phe Val Leu 
770 775 780 

Ser Asn Pro His Pro Leu Phe Lys Thr Phe lie Pro Gin Met Lys Val 
785 790 795 800 

Glu Met Val Tyr Val Ser Gin Asn Ser Phe Phe Glu Ser Ser Ser Asp 
805 810 815 

Gly Arg Gly Phe Ser lie Gly Arg Leu Leu Asn Leu Ser lie Pro Val 
820 825 830 

Gly Ala Lys Phe Val Gin Gly Asp He Gly Asp Ser Tyr Thr Tyr Asp 
835 840 845 

Leu Ser Gly Phe Phe Val Ser Asp Val Tyr Arg Asn Asn Pro Gin Ser 
850 855 860 

Thr Ala Thr Leu Val Met Ser Pro Asp Ser Trp Lys He Arg Gly Gly 
865 870 875 880 

Asn Leu Ser Arg Gin Ala Phe Leu Leu Arg Gly Ser Asn Asn Tyr Val 
885 890 895 

Tyr Asn Ser Asn Cys Glu Leu Phe Gly His Tyr Ala Met Glu Leu Arg 
900 905 910 



Gly Ser Ser Arg Asn Tyr Asn Val Asp Val Gly Thr Lys Leu Arg Phe 
915 920 925 
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<210> 13 
<211> 885 
<212> PRT 

<213> Chlamydia pneumoniae 
<400> 13 . 

Gly Thr Tyr Thr Pro Lys Asn Thr Thr Thr Gly lie Asp Tyr Thr Leu 
1 5 10 15 

Thr Gly Asp lie Thr Leu Gin Asn Leu Gly Asp Ser Ala Ala Leu Thr 
20 25 30 

Lys Gly Cys Phe Ser Asp Thr Thr Glu Ser Leu Ser Phe Ala Gly Lys 
35 40 45 

Gly Tyr Ser Leu Ser Phe Leu Asn lie. Lys Ser Ser Ala Glu Gly Ala 
50 55 60 

Ala Leu Ser Val Thr Thr Asp Lys Asn Leu Ser Leu Thr Gly Phe Ser 
65 70 75 80 

Ser Leu Thr Phe Leu Ala Ala Pro Ser Ser Val lie Thr Thr Pro Ser 
85 90 95 

Gly Lys Gly Ala Val Lys Cys Gly Gly Asp Leu Thr Phe Asp Asn Asn 
100 105 110 

Gly Thr lie Leu Phe Lys Gin Asp Tyr Cys Glu Glu Asn Gly Gly Ala 
115 120 125 

lie Ser Thr Lys Asn Leu Ser Leu Lys Asn Ser Thr Gly Ser He Ser 
130 135 140 

Phe Glu Gly Asn Lys Ser Ser Ala Thr Gly Lys Lys Gly Gly Ala He 
145 150 155 160 

Cys Ala Thr Gly Thr Val Asp He Thr Asn Asn Thr Ala Pro Thr Leu 
165 170 175 

Phe Ser Asn Asn He Ala Glu hla Ala Gly Gly Ala He Asn Ser Thr 
180 185 190 

Gly Asn Cys Thr He Thr Gly Asn Thr Ser Leu Val Phe Ser Glu Asn 
195 200 205 

Ser Val Thr Ala Thr Ala Gly Asn Gly Gly Ala Leu Ser Gly Asp Ala 
210 215 220 

Asp Val Thr He Ser Gly Asn Gin Ser Val Thr Phe Ser Gly Asn Gin 
225 230 235 240 

Ala Val Ala Asn Gly Gly Ala He Tyr Ala Lys Lys Leu Thr Leu Ala 
245 250 255 

Ser Gly Gly Gly Gly Gly Asn Pro Phe Ser Asn Asn lie Val Gin Gly 
260 265 270 
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Thr Thr Ala Gly Asn Giy Gly Ala He Ser He Leu Ala Ala Gly Glu 
275 280 285 

Cys Ser Leu Phe Ser Glu Ala Gly Asp His Tyr Leu Asn Gly Asn Ala 
290 295 300 

He Val Ala Thr Thr Pro Gin Thr Thr Lys Arg Asn Ser He Asp He 
305 310 315 320 

Gly Ser Thr Gly Lys Asp His Glu Leu Arg Ala He Ser Gly His Ser 
325 330 335 

He Phe Phe Tyr Asp Pro He Thr Ala Asn Thr Ala Ala Asp Ser Thr 
340 345 350 

Asp Thr Leu Asn Leu Asn Lys Ala Asp Ala Gly Asn Ser Thr Asp Tyr 
355 360 ^ 365 

Ser Gly Ser He Val Phe Ser Gly Glu Lys Leu Ser Glu Asp Glu Ala 
370 375 380 

Lys Val Ala Asp Asn Leu Thr Ser Thr Leu Lys Gin Pro Val Thr Leu 
385 390 395 400 

Thr Ala Gly Asn Leu Val Leu Lys Arg Gly Val Thr Leu Asp Thr Lys 
405 410 415 

Gly Phe Thr Gin Thr Ala Gly Ser Ser Val He Met Asp Ala Gly Thr 
420 425 430 

Thr Leu Lys Ala Ser Thr Glu Glu Val Thr Leu Thr Gly Leu Ser He 
435 440 445 

Pro Val Asp Ser Leu Gly Glu Gly Lys Lys Val Val He Ala Ala Ser 
450 455 460 

Ala Ala Ser Lys Asn Val Ala Leu Ser Gly Pro He Leu Leu Leu Asp 
465 470 475 480 

Asn Gin Gly Asn Ala Tyr Glu Asn His Asp Leu Gly Lys Thr Gin Asp 
485 490 495 

Phe Ser Phe Val Gin Leu Ser Ala Leu Gly Thr Ala Thr Thr Thr Asp 
500 505 510 

Val Pro Ala Val Pro Thr Val Ala Thr Pro Thr His Tyr Gly Tyr Gin 
515 520 525 

Gly Thr Trp Gly Met Thr Trp Val Asp Asp Thr Ala Ser Thr Pro Lys 
530 535 540 

Thr Lys Thr Ala Thr Leu Ala Trp Thr Asn Thr Gly Tyr Leu Pro Asn 
545 550 555 560 



Pro Glu Arg Gin Gly Pro Leu Val Pro Asn Ser Leu Trp Gly Ser Phe 
565 570 575 
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Ser Asp lie Gin Ala lie Gin Gly Val lie Glu Arg Ser Ala Leu Thr 
580 585 590 

Leu Cys Ser Asp Arg Gly Phe Trp Ala Ala Gly Val Ala Asn Phe Leu 
595 600 605 

Asp Lys Asp Lys Lys Gly Glu Lys Arg Lys Tyr Arg His Lys Ser Gly 
610 615 620 

Gly Tyr Ala lie Gly Gly Ala Ala Gin Thr Cys Ser Glu Asn Leu lie 
625 630 635 640 

Ser Phe Ala Phe Cys Gin Leu Phe Gly Ser Asp Lys Asp Phe Leu Val 
645 650 655 

Ala Lys Asn His Thr Asp Thr Tyr Ala Gly Ala Phe Tyr He Gin His 
660 665 670 

He Thr Glu Cys Ser Gly Phe He Gly Cys Leu Leu Asp Lys Leu Pro 
675 680 685 

Gly Ser Trp Ser His Lys Pro Leu Val Leu Glu Gly Gin Leu Ala Tyr 
690 695 700 

Ser His Val Ser Asn Asp Leu Lys Thr Lys Tyr Thr Ala Tyr Pro Glu 
705 710 715 720 

Val Lys Gly Ser Trp Gly Asn Asn Ala Phe Asn Met Met Leu Gly Ala 
725 730 735 

Ser Ser His Ser Tyr Pro Glu Tyr Leu His Cys Phe Asp Thr Tyr Ala 
740 745 750 

Pro Tyr He Lys Leu Asn Leu Thr Tyr He Arg Gin Asp Ser Phe Ser 
755 760 765 

Glu Lys Gly Thr Glu Gly Arg Ser Phe Asp Asp Ser Asn Leu Phe Asn 
770 775 780 

Leu Ser Leu Pro He Gly Val Lys Phe Glu Lys Phe Ser Asp Cys Asn 
785 790 795 800 

Asp Phe Ser Tyr Asp Leu Thr Leu Ser Tyr Val Pro Asp Leu He Arg 
805 810 815 

Asn Asp Pro Lys Cys Thr Thr Ala Leu Val He Ser Gly Ala Ser Trp 
820 825 830 



Glu Thr Tyr Ala Asn Asn Leu Ala 
835 840 

Gly Ser His Tyr Ala Phe Ser Pro 
850 855 

Val Phe Glu Val Arg Gly Ser Ser 
865 870 



Arg Gin Ala Leu Gin Val Arg Ala 
845 

Met Phe Glu Val Leu Gly Gin Phe 
860 

Arg He Tyr Asn Val Asp Leu Gly 
875 880 
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Gly Lys Phe Gin Phe 
885 



<210> 14 
<211> 928 
<212> PRT 

<213> Chlamydia pneumoniae 
<400> 14 

Met Lys Ser Ser Leu His Trp Phe Leu lie Ser Ser Ser Leu Ala Leu 
1 5 10 15 

Pro Leu Ser Leu Asn Phe Ser Ala Phe Ala Ala Val Val Glu lie Asn 

20 25 30 

Leu Gly Pro Thr Asn Ser Phe Ser Gly Pro Gly Thr Tyr Thr Pro Pro 
35 40 45 

Ala Gin Thr Thr Asn Ala Asp Gly Thr lie Tyr Asn Leu Thr Gly Asp 
50 55 60 

Val Ser lie Thr Asn Ala Gly Ser Pro Thr Ala Leu Thr Ala Ser Cys 
65 70 ' 75 80 

Phe Lys Glu Thr Thr Gly Asn Leu Ser Phe Gin Gly His Gly Tyr Gin 
85 90 95 

Phe Leu Leu Gin Asn lie Asp Ala Gly Ala Asn Cys Thr Phe Thr Asn 
100 105 110 

Thr Ala Ala Asn Lys Leu Leu Ser Phe Ser Gly Phe Ser Tyr Leu Ser 
115 120 125 

Leu lie Gin Thr Thr Asn Ala Thr Thr Gly Thr Gly Ala lie Lys Ser 
130 135 140 

Thr Gly Ala Cys Ser lie Gin Ser Asn Tyr Ser Cys Tyr Phe Gly Gin 
145 150 155 160 

Asn Phe Ser Asn Asp Asn Gly Gly Ala^Leu Gin Gly Ser Ser lie Ser 
165 170 175 

Leu Ser Leu Asn Pro Asn Leu Thr Phe Ala Lys Asn Lys Ala Thr Gin 
180 185 190 

Lys Gly Gly Ala Leu Tyr Ser Thr Gly Gly lie Thr lie Asn Asn Thr 
195 200 ' 205 

Leu Asn Ser Ala Ser Phe Ser Glu Asn Thr Ala Ala Asn Asn Gly Gly 
210 215 220 

Ala lie Tyr Thr Glu Ala Ser Ser Phe lie Ser Ser Asn Lys Ala lie 
225 230 235 240 
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Ser Phe lie Asn Asn Ser Val Thr Ala Thr Ser Ala Thr Gly Gly Ala 
245 250 255 

lie Tyr Cys Ser Ser Thr Ser Ala Pro Lys Pro Val Leu Thr Leu Ser 
260 265 270 

Asp Asn Gly Glu Leu Asn Phe lie Gly Asn Thr Ala lie Thr Ser Gly 
275 280 285 

Gly Ala lie Tyr Thr Asp Asn Leu Val Leu Ser Ser Gly Gly Pro Thr 
290 295 300 

Leu Phe Lys Asn Asn Ser Gly Tyr Asp Thr Ala Ala Pro Leu Gly Gly 
305 310 315 320 

Ala lie Ala lie Ala Asp Ser Gly Ser. Leu Ser Leu Ser Ala Leu Gly 
325 330 335 

Gly Asp lie Thr Phe Glu Gly Asn Thr Val Val Lys Gly Ala Ser Ser 
340 345 350 

Ser Gin Thr Thr Thr Arg Asn Ser lie Asn lie Gly Asn Thr Asn Ala 
355 360 365 

Lys lie Val Gin Leu Arg Ala Ser Gin Gly Asn Thr lie Tyr Phe Tyr 
370 375 380 

Asp Pro lie Thr Thr Ser lie Thr Ala Ala Leu Ser Asp Ala Leu Asn 
385 390 ' 395 400 

Leu Asn Gly Pro Asp Leu Ala Gly Asn Pro Ala Tyr Gin Gly Thr lie 
405 410 415 

Val Phe Ser Gly Glu Lys Leu Ser Glu Ala Glu Ala Ala Glu Ala Asp 
420 425 430 

Asn Leu Lys Ser Thr lie Gin Gin Pro Leu Thr Leu Ala Gly Gly Gin 
435 440 445 

Leu Ser Leu Lys Ser Gly Val Thr Leu Val Ala Lys Ser Phe Ser Gin 
450 455 460 

Ser Pro Gly Ser Thr Leu Leu Met Asp Ala Gly Thr Thr Leu Glu Thr 
465 470 475 480 

Ala Asp Gly lie Thr lie Asn Asn Leu Val Leu Asn Val Asp Ser Leu 
485 490 495 

Lys Glu Thr Lys Lys Gly Thr Leu Lys Ala Thr Gin Ala Ser Gin Thr 
500 505 510 

Val Thr Leu Ser Gly Ser Leu Ser Leu Val Asp Pro Ser Gly Asn Val 
515 520 525 

Tyr Glu Asp Val Ser Trp Asn Asn Pro Gin Val Phe Ser Cys Leu Thr 
530 535 540 
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Leu Thr Ala Asp Asp Pro Ala Asn lie His lie Thr Asp Leu Ala Ala 
545 550 555 560 

Asp Pro Leu Glu Lys Asn Pro lie His Trp Gly Tyr Gin Gly Asn Trp 
565 570 575 

Ala Leu Ser Trp Gin Glu Asp Thr Ala Thr Lys Ser Lys Ala Ala Thr 
580 585 590 

Leu Thr Trp Thr Lys Thr Gly Tyr Asn Pro Asn Pro Glu Arg Arg Gly 
595 600 605 

Thr Leu Val Ala Asn Thr Leu Trp Gly Ser Phe Val Asp Val Arg Ser 
610 615 620 

lie Gin Gin Leu Val Ala Thr Lys Val Arg Gin Ser Gin Glu Thr Arg 
625 630 635 640 

Gly lie Trp Cys Glu Gly lie Ser Asn Phe Phe His Lys Asp Ser Thr 
645 650 655 

Lys lie Asn Lys Gly Phe Arg His lie Ser Ala Gly Tyr Val Val Gly 
660 665 670 

Ala Thr Thr Thr Leu Ala Ser Asp Asn Leu He Thr Ala Ala Phe Cys 
675 680 685 

Gin Leu Phe Gly Lys Asp Arg Asp His Phe He Asn Lys Asn Arg Ala 
690 695 700 

Ser Ala Tyr Ala Ala Ser Leu His Leu Gin His Leu Ala Thr Leu Ser 
705 710 715 720 

Ser Pro Ser Leu Leu Arg Tyr Leu Pro Gly Ser Glu Ser Glu Gin Pro 
725 730 735 

Val Leu Phe Asp Ala Gin He Ser Tyr He Tyr Ser Lys Asn Thr Met 
740 745 750 

Lys Thr Tyr Tyr Thr Gin Ala Pro Lys Gly Glu Ser Ser Trp Tyr Asn 
755 760 765 

Asp Gly Cys Ala Leu Glu Leu Ala Ser Ser Leu Pro His Thr Ala Leu 
770 775 780 

Ser His Glu Gly Leu Phe His Ala Tyr Phe Pro Phe He Lys Val Glu 
785 790 795 800 

Ala Ser Tyr He His Gin Asp Ser Phe Lys Glu Arg Asn Thr Thr Leu 
805 810 815 

Val Arg Ser Phe Asp Ser Gly Asp Leu He Asn Val Ser Val Pro He 
820 825 830 



Gly He Thr Phe Glu Arg Phe Ser Arg Asn Glu Arg Ala Ser Tyr Glu 
835 840 845 
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Ala Thr Val lie Tyr Val Ala Asp Val Tyr Arg Lys Asn Pro Asp Cys 
850 855 860 

Thr Thr Ala Leu Leu lie Asn Asn Thr Ser Trp Lys Thr Thr Gly Thr 
865 870 875 880 

Asn Leu Ser Arg Gin Ala Gly He Gly Arg Ala Gly He Phe Tyr Ala 
885 890 895 

Phe Ser Pro Asn Leu Glu Val Thr Ser Asn Leu Ser Met Glu He Arg 
900 905 910 

Gly Ser Ser Arg Ser Tyr Asn Ala Asp Leu Gly Gly Lys Phe Gin Phe 
915 920 925 



<210> 15 
<211> 930 
<212> PRT 

<213> Chlamydia pneumoniae 
<400> 15. 

Met Lys He Pro Leu His Lys Leu Leu He Ser Ser Thr Leu Val Thr 
1 5 10 15 

Pro He Leu Leu Ser He Ala Thr Tyr Gly Ala Asp Ala Ser Leu Ser 
20 25 30 

Pro Thr Asp Ser Phe Asp Gly Ala Gly Gly Ser Thr Phe Thr Pro Lys 
35 40 45 

Ser Thr Ala Asp Ala Asn Gly Thr Asn Tyr Val Leu Ser Gly Asn Val 
50 55 60 

Tyr He Asn Asp Ala Gly Lys Gly Thr Ala Leu Thr Gly Cys Cys Phe 
65 70 75 80 

Thr Glu Thr Thr Gly Asp Leu Thr Phe Thr Gly Lys Gly Tyr Ser Phe 
85 90 95 

Ser Phe Asn Thr Val Asp Ala Gly Ser Asn Ala Gly Ala Ala Ala Ser 
100 105 110 

Thr Thr Ala Asp Lys Ala Leu He Phe Thr Gly Phe Ser Asn Leu Ser 
115 120 125 

Phe He Ala Ala Pro Gly Thr Thr Val Ala Ser Gly Lys Ser Thr Leu 
130 135 140 

Ser Ser Ala Gly Ala Leu Asn Leu Thr Asp Asn Gly Thr He Leu Phe 
145 150 155 160 

Ser Gin Asn Val Ser Asn Glu Ala Asn Asn Asn Gly Gly Ala He Thr 
165 170 175 
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Thr Lys Thr Leu Ser He Ser Gly Asn Thr Ser Ser He Thr Phe Thr 
180 185 190 

Ser Asn Ser Ala Lys Lys Leu Gly Gly Ala He Tyr Ser Ser Ala Ala 
195 200 205 

Ala Ser He Ser Gly Asn Thr Gly Gin Leu Val Phe Met Asn Asn Lys 
210 215 220 

Gly Glu Thr Gly Gly Gly Ala Leu Gly Phe Glu Ala Ser Ser Ser He 
225 230 235 240 

Thr Gin Asn Ser Ser Leu Phe Phe Ser Gly Asn Thr Ala Thr Asp Ala 
245 250 255 

Ala Gly Lys Gly Gly Ala He Tyr Cys Glu Lys Thr Gly Glu Thr Pro 
260 265 270 



Thr Leu Thr He Ser Gly Asn Lys Ser Leu Thr Phe Ala Glu Asn Ser 
275 280 285 



Ser Val Thr Gin Gly Gly Ala He Cys Ala His Gly Leu Asp Leu Ser 
290 295 300 



Ala Ala Gly Pro Thr Leu Phe Ser Asn Asn Arg Cys Gly Asn Thr Ala 
305 310 315 320 

Ala Gly Lys Gly Gly Ala He Ala He Ala Asp Ser Gly Ser Leu Ser 
325 330 335 

Leu Ser Ala Asn Gin Gly Asp He Thr Phe Leu Gly Asn Thr Leu Thr 
340 345 350 

Ser Thr Ser Ala Pro Thr Ser Thr Arg Asn Ala He Tyr Leu Gly Ser 
355 360 365 

Ser Ala Lys He Thr Asn Leu Arg Ala Ala Gin Gly Gin Ser He Tyr 
370 375 380 



Phe Tyr Asp Pro He Ala Ser Asn Thr Thr Gly Ala Ser Asp Val Leu 
385 390 395 400 

Thr He Asn Gin Pro Asp Ser Asn Ser Pro Leu Asp Tyr Ser. Gly Thr 
405 410 415 

He Val Phe Ser Gly Glu Lys Leu Ser Ala Asp Glu Ala Lys Ala Ala 
420 425 430 

Asp Asn Phe Thr Ser He Leu Lys Gin Pro Leu Ala Leu Ala Ser Gly 
435 440 445 

Thr Leu Ala Leu Lys Gly Asn Val Glu Leu Asp Val Asn Gly Phe Thr 
450 455 460 



Gin Thr Glu Gly Ser Thr Leu Leu Met Gin Pro Gly Thr Lys Leu Lys 
465 470 475 480 
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Ala Asp Thr Glu Ala lie Ser Leu Thr Lys Leu Val Val Asp Leu Ser. 

485 490 495 

Ala Leu Glu Gly Asn Lys Ser Val Ser He Glu Thr Ala Gly Ala Asn 
500 505 510 

Lys Thr He Thr Leu Thr Ser Pro Leu Val Phe Gin Asp Ser Ser Gly 
515 520 525 

Asn Phe Tyr Glu Ser His Thr He Asn Gin Ala Phe Thr Gin Pro Leu 
530 535 540 

Val Val Phe Thr Ala Ala Thr Ala Ala Ser Asp He Tyr He Asp Ala 
545 550 555 560 

Leu Leu Thr Ser Pro Val Gin Thr Pro Glu Pro His Tyr Gly Tyr Gin 
565 570 575 

Gly His Trp Glu Ala Thr Trp Ala Asp Thr Ser Thr Ala Lys Ser Gly 
580 585 590 

Thr Met Thr Trp Val Thr Thr Gly Tyr Asn Pro Asn Pro Glu Arg Arg 
595 600 605 

Ala Ser Val Val Pro Asp Ser Leu Trp Ala Ser Phe Thr Asp He Arg 
610 615 620 

Thr Leu Gin Gin He Met Thr Ser Gin Ala Asn Ser He Tyr Gin Gin 
625 630 635 640 

Arg Gly Leu Trp Ala Ser Gly Thr Ala Asn Phe Phe His Lys Asp Lys 
645 650 655 

Ser Gly Thr Asn Gin Ala Phe Arg His Lys Ser Tyr Gly Tyr He Val 
660 665 670 

Gly Gly Ser Ala Glu Asp Phe Ser Glu Asn He Phe Ser Val Ala Phe 
675 680 685 

Cys Gin Leu Phe Gly Lys Asp Lys Asp Leu Phe He Val Glu Asn Thr 
690 695 700 

Ser His Asn Tyr Leu Ala Ser Leu Tyr Leu Gin His Arg Ala Phe Leu 
705 7.10 715 720 

Gly Gly Leu Pro Met Pro Ser Phe Gly Ser He Thr Asp Met Leu Lys 
725 730 735 

Asp He Pro Leu He Leu Asn Ala Gin Leu Ser Tyr Ser Tyr Thr Lys 
740 745 750 

Asn Asp Met Asp Thr Arg Tyr Thr Ser Tyr Pro Glu Ala Gin Gly Ser 
755 760 765 



Trp Thr Asn Asn Ser Gly Ala Leu Glu Leu Gly Gly Ser Leu Ala Leu 
770 775 780 
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Tyr Leu Pro Lys Glu Ala 
785 790 



Pro Phe Phe Gin Gly Tyr Phe Pro Phe Leu 
795 800 



Lys Phe Gin Ala Val Tyr 
805 



Ser Arg Gin Gin Asn Phe Lys Glu Ser Gly 
810 815 



Ala Glu Ala Arg Ala Phe 
820 



Asp Asp Gly Asp Leu Val Asn Cys Ser lie 
825 830 



Pro Val Gly lie Arg Leu 
835 



Glu Lys lie Ser Glu Asp Glu Lys Asn Asn 
840 845 



Phe Glu He Ser Leu Ala 
850 



Tyr He Gly Asp Val Tyr Arg Lys Asn Pro 
855 860 



Arg Ser Arg Thr Ser Leu 
865 870 



Met Val Ser .Gly Ala Ser Trp Thr Ser Leu 
875 880 



Cys Lys Asn Leu Ala Arg 
885 



Gin Ala Phe Leu Ala Ser Ala Gly Ser His 
890 895 



Leu Thr Leu Ser Pro His 
900 



Val Glu Leu Ser Gly Glu Ala Ala Tyr Glu 
905 910 



Leu Arg Gly Ser Ala His 
915 



He Tyr Asn Val Asp Cys Gly Leu Arg Tyr 
920 925 



Ser Phe 
930 



<210> 16 
<211> 293 
<212> PRT 

<213> Chlamydia pneumoniae 
<400> 16 

Met Leu Ser Ser Leu He Arg Asp Ser Phe Pro Leu Leu He Leu Leu 
1 5 10 15 

Pro Thr Phe Leu Ala Ala Leu Gly Ala Ser Val Ala Gly Gly Val Met 
20 25 30 

Gly Thr Tyr He Val Val Lys Arg He Val Ser He Ser Gly Ser He 
35 40 45 

Ser His Ala He Leu Gly Gly He Gly Leu Thr Leu Trp He Gin Tyr 
50 55 60 

Lys Leu His Leu Ser Phe Phe Pro Met Tyr Gly Ala He Val Gly Ala 
65 70 75 80 

He Phe Leu Ala Leu Cys He Gly Lys He His Leu Lys Tyr Gin Glu 
85 90 95 
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Arg Glu Asp Ser Leu lie Ala Met He Trp Ser Val Gly Met Ala He 
100 105 110 

Gly He lie Phe He Ser Arg Leu Pro Thr Phe Asn Gly Glu Leu He 
115 120 125 



Asn Phe Leu Phe Gly Asn He Leu 
130 135 

Ser Leu Gly He Phe Asp Leu Leu 
145 150 

His Thr Arg Phe Leu Ala Leu Cys 
165 

Asn His Cys Ser Val Gin Leu Trp 
180 

Ala He Thr He Val Met Leu He 
195 200 



Trp Val Thr Pro Ser Asp Leu Tyr 
140 

Val Leu Gly He Val Val Leu Cys 
155 160 

Phe Asp Glu Arg Tyr Thr Ala Leu 
170 175 

Tyr Phe Leu Leu Leu Val Leu Thr 
185 190 

Tyr Val Met Gly Thr He Leu Met 
205 



Leu Ser Met Leu Val Leu Pro Val 
210 215 

Lys Met Thr Arg He Met Phe He 
225 230 

Ser Phe Ser Gly He Cys He Ala 
245 

Pro Thr He Ser Leu Leu Met Gly 
260 

Val Lys Lys Arg Tyr Asn Pro Ser 
275 280 



Ala He Ala Cys Arg Phe Ser Tyr 
220 

Ser Val Leu Leu Asn He Leu Cys 
235 240 

Tyr Cys Leu Asp Phe Pro Val Gly 
250 255 

Leu Gly Tyr Thr Ala Ser Leu Cys 
265 270 

Thr Pro Ser Pro Val Ser Pro Glu 
285 



He Asn Thr Asn Val 
290 
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